Diaeresis and Umlaut
From FrathWiki
Jump to navigationJump to search
Diaeresis (known as tréma in French) and umlaut both employ the same character. But there is a difference of use between diaeresis and umlaut. Letters with umlaut stand for completely different sounds than their non-accented counterparts. For example in Swedish Oo represents /u/ while Öö represents /ø/. Diaeresis on the other hand does not change the sound value of a letter, but instead marks that a vowel is not part of a diphthong or digraph. Both are also known under the general name trema.
The diaeresis and umlaut characters have different origins. Diaeresis was borrowed from the Greek alphabet,[1] while umlaut began as a small e placed on top of Aa, Oo or Uu. This e then later evolved into the same shape as diaeresis.[2]
Diaeresis/Umlaut in Unicode
¨ | ◌̈ | Ä | ä | Ǟ | ǟ | Ë | ë | Ḧ | ḧ | Ï | ï | Ḯ |
U+00A8 | U+0308 | U+00C4 | U+00E4 | U+01DE | U+01DF | U+00CB | U+00EB | U+1E26 | U+1E27 | U+00CF | U+00EF | U+1E2E |
Diaeresis | Combining Diaeresis | Latin Capital Letter A With Diaeresis | Latin Small Letter A With Diaeresis | Latin Capital Letter A With Diaeresis And Macron | Latin Small Letter A With Diaeresis And Macron | Latin Capital Letter E With Diaeresis | Latin Small Letter E With Diaeresis | Latin Capital Letter H With Diaeresis | Latin Small Letter H With Diaeresis | Latin Capital Letter I With Diaeresis | Latin Small Letter I With Diaeresis | Latin Capital Letter I With Diaeresis And Acute |
ḯ | Ö | ö | Ȫ | ȫ | Ṏ | ṏ | ẗ | Ü | ü | Ǖ | ǖ | Ǘ |
U+1E2F | U+00D6 | U+00F6 | U+022A | U+022B | U+1E4E | U+1E4F | U+1E97 | U+00DC | U+00FC | U+01D5 | U+01D6 | U+01D7 |
Latin Small Letter I With Diaeresis And Acute | Latin Capital Letter O With Diaeresis | Latin Small Letter O With Diaeresis | Latin Capital Letter O With Diaeresis And Macron | Latin Small Letter O With Diaeresis And Macron | Latin Capital Letter O With Tilde And Diaeresis | Latin Small Letter O With Tilde And Diaeresis | Latin Small Letter T With Diaeresis | Latin Capital Letter U With Diaeresis | Latin Small Letter U With Diaeresis | Latin Capital Letter U With Diaeresis And Macron | Latin Small Letter U With Diaeresis And Macron | Latin Capital Letter U With Diaeresis And Acute |
ǘ | Ǚ | ǚ | Ǜ | ǜ | Ṻ | ṻ | Ẅ | ẅ | Ẍ | ẍ | Ÿ | ÿ |
U+01D8 | U+01D9 | U+01DA | U+01DB | U+01DC | U+1E7A | U+1E7B | U+1E84 | U+1E85 | U+1E8C | U+1E8D | U+0178 | U+00FF |
Latin Small Letter U With Diaeresis And Acute | Latin Capital Letter U With Diaeresis And Caron | Latin Small Letter U With Diaeresis And Caron | Latin Capital Letter U With Diaeresis And Grave | Latin Small Letter U With Diaeresis And Grave | Latin Capital Letter U With Macron And Diaeresis | Latin Small Letter U With Macron And Diaeresis | Latin Capital Letter W With Diaeresis | Latin Small Letter W With Diaeresis | Latin Capital Letter X With Diaeresis | Latin Small Letter X With Diaeresis | Latin Capital Letter Y With Diaeresis | Latin Small Letter Y With Diaeresis |
Diaeresis/Umlaut in Natlangs
Usage | Language | Letters | Notes |
---|---|---|---|
Central vowel | Albanian (Manastir (current) alphabet) | Ëë /ə/ | |
Kazakh (2019 and 2021 alphabets, as well as Kazinform's romanization) | Üü /ʉ/ | Unaccented Uu stands for /ʊ/ in the 2019 alphabet and Kazinform's romanization, and for /ʊw/ and /w/ in the 2021 alphabet.[3] | |
Moro | Ëë /ˈəː/ | This letter represents a "long or stressed ‘ə’",[4] but the phonemicity of it is contested.[5] The orthography for Moro did not have capital letters originally.[4] | |
Change of place of articulation | Malagasy | N̈n̈ /ŋ/ | This letter is used in some dialects. It may optionally be replaced by Ññ or Ng ng.[6] Note that N̈n̈ is not a precomposed letter. |
Diphthong | Adyghe (BGN/PCGN 2012 romanization) | Ëë /jo/ | Ëë only appears in Russian loan words, as transliteration of Cyrillic Ёё.[7] |
Kazakh (Kazinform's romanization) | Ïï /əj/ | This romanizations also used Iı for /ə/ and İi for /ɪ/.[3] | |
Disambiguation of homographs | Tahitian | Ïï /iː/ | This letter is only used in the reflexive pronoun ïa to distinguish from some other word, pronounced the same. The letter is barely used nowadays however.[8] |
Front version of back vowel (Ää is included here, even though its unaccented version is not a back vowel in all of these languages) | Estonian | Ää /æ/, Öö /ø/, Üü /y/ | |
Finnish | Ää /æ/, Öö /ø/ | Usage borrowed from Swedish. | |
German | Ää /ɛ/, Öö /ø/, Üü /y/ | The umlaut evolved from the letter e in the digraphs ae, oe and ue. | |
Hungarian | Öö /ø/, Üü /y/ | ||
Icelandic | Öö /œ/ | ||
Kazakh (2019 and 2021 alphabets, as well as Kazinform's romanization) | Ää /æ/, Öö /œ/ | Unaccented Aa and Oo stand for /ɑ/ and /o/ respectively.[3] | |
Livonian | Ää /æ/, Ǟǟ /æː/ | ||
Mandarin (Pinyin romanization) | Üü /y/, Ǖǖ /y˥/, Ǘǘ /y˧˥/, Ǚǚ /y˨˩˦/, Ǜǜ /˥˩/ | Üü without tone markings may stand for the so called neutral tone,[9] or it is simply due to no tone marks being used in the given text.[10] Note that these tone values are based on the Beijing dialect.[11] | |
Slovak | Ää /æ~ɛ/ | [æ] is dialectal pronunciation, with most speakers merging it with the phoneme /ɛ/ or /a/.[12] | |
Swedish | Ää /ɛ/, Öö /ø/, Üü /y/ | The umlaut evolved from the letter e in the digraphs ae[13] and oe.[14] Üü is not really a part of the Swedish alphabet, but is regularly used in some loanwords and surnames. | |
Turkish | Öö /œ/, Üü /y/ | Oo and Uu stand for /o/ and /u/, respectively. | |
Hiatus | Catalan | Ïï /i/, Üü /u/ | Diaeresis on an Ii or Uu following another vowel marks that the two vowels are in different syllables. Without diaeresis, the Ii or Uu would stand for a semivowel.[15] |
French | Ëë, Ïï, Üü, Ÿÿ | ||
Welsh | Ëë /ɛ, eː/, Ïï /ɪ, iː, ij/, Üü /ɨ̞, ɨː, ɪ, iː/, Ẅẅ /ʊ, uː/, Ÿÿ /ɨ̞, ɨː, ɪ, iː, ə, əː/ | The diaeresis is used for marking that a vowel is not part of a diphthong. The diaeresis is sometimes omitted in casual speech. Ïï stands for /ij/ when it is followed by another vowel.[16] Regarding Üü and Ÿÿ: The realizations /ɨ̞, ɨː, ə/ are used in northern dialects and /ɪ, iː, ə, əː/ in southern dialects. | |
Non-silent vowel | Catalan | Üü /w/ | Diaeresis on an Uu that is between Gg or Qq and a front vowel marks that this letter stands for /w/. Otherwise it would be a part of the digraph Gu gu /g/ or Qu qu /k/ that is used before front vowels.[15] |
Spanish | Üü [w] | Diaeresis is used on a Uu between Gg and a front vowel, to show that the Uu is not a silent letter as would otherwise be the case.[17] | |
Raised vowel | Hungarian | Ëë /e/ | Unaccented Ee stands for /ɛ/. Ëë is not really a part of the Hungarian alphabet however; it is used when writing down spoken or sung language in a dialect that has this phoneme. |
Short vowel | Adyghe (BGN/PCGN 2012 romanization) | Ää /a/ | Unaccented Aa represents /aː/.[7] |
Other | Arabic (ISO 233 romanization) | T̈ẗ /a(t)/ | This letter is used for transcribing the Arabic letter ة which is used for a suffix which may or may not include a /t/, depending on context.[18] Note that there is no precomposed form of capital T̈. |
Diaeresis/Umlaut in Phonetic Transcription
Use | Transcription system | Notes |
---|---|---|
Centralized vowel | International Phonetic Alphabet (IPA) | Used for marking that a vowel has (more) centralized place of articulation (than what the base letter implies).[19] |
Diaeresis/Umlaut in Conlangs
Usage | Language | Creator | Letters | Notes |
---|---|---|---|---|
Digraph disambiguation | Lhueslue (external romanization) | Qwynegold | Ëë /e/ | The diaeresis is used when /e/ follows another vowel, and signals that these two vowel letters do not form a digraph. These two vowels are pronounced as a diphthong.[20] |
Front version of back vowel | Qwynegold (Qwadralónia dialect) | Qwynegold | Ää /æ, ɛ/, Ä́ä́ /æˑ, ɛˑ/, Ā̈ā̈ /æː, ɛː/, Öö /ø, œ/, Ö́ö́ /øˑ, œˑ/, Ō̈ō̈ /øː, œː/ | Ä́ä́, Ā̈ā̈, Ö́ö́, Ō̈ō̈ have no precomposed forms. |
Songulda (external romanization) | Qwynegold | Öö /ø/, Üü /y/ | Unaccented Oo, Uu stand for /o, u/.[21] | |
Stress | Seebee (external romanization) | Qwynegold | ȷ̈ /ˈj/ | Normally a dot is placed below the first letter of a stressed syllable, but in the case of lower case j, umlaut is used instead because there is not space for a dot neither below or above the letter otherwise. Note that ȷ̈ is not a precomposed letter, but a combination of dotless ȷ and combining diaeresis. |
See Also
References
- ↑ Diaeresis, Diaeresis, History at Wikipedia.
- ↑ Diaeresis, Umlaut, History at Wikipedia.
- ↑ 3.0 3.1 3.2 Kazakh alphabets at Wikipedia. See also Kazakh language, Phonology on Wikipedia.
- ↑ 4.0 4.1 Guest, Elizabeth. 1997. Moro Phonology.
- ↑ Blench, Roger. 2005. A dictionary of the Moro language of the Nuba hills, Sudan .
- ↑ Malagasy language, Diacritics at Wikipedia.
- ↑ 7.0 7.1 Romanization of Adyghe (PDF).
- ↑ Tahitian language, Phonology at Wikipedia. The source is not entirely clear regarding the word ïa, but it seems to be pronounced /iːa/.
- ↑ Pinyin, Numerals in place of tone marks at Wikipedia.
- ↑ Pinyin at Wikipedia.
- ↑ Mandarin Chinese, Tones at Wikipedia.
- ↑ Slovak phonology at Wikipedia.
- ↑ Ä at Wikipedia.
- ↑ Ö at Wikipedia.
- ↑ 15.0 15.1 Catalan alphabet, Diaeresis at Wikipedia.
- ↑ Welsh orthography at Wikipedia.
- ↑ Spanish orthography, Special and modified letters at Wikipedia.
- ↑ Taw, Tāʼ marbūṭah at Wikipedia.
- ↑ Relative articulation, Centralized at Wikipedia.
- ↑ Lhueslue, Romanization at FrathWiki.
- ↑ Songulda language, Romanization and pronunciation at FrathWiki.