The wiki has recently been updated. Please contact me by talk page or email if you encounter any issues.

Natlang Uses of Diacritics in the Latin Alphabet: Difference between revisions

From FrathWiki
Jump to navigationJump to search
m (Restructuring the article)
mNo edit summary
 
(65 intermediate revisions by 2 users not shown)
Line 1: Line 1:
{{WIP}}
This is a collection of articles that list different uses of diacritical marks that have natlang precedence. Conlangers can use this to find inspiration for their own conlang's orthography or transliteration. These articles could also be used as reference for those designing a keyboard layout.<br>
This page will list different uses of diacritical marks that have natlang precedence. Conlangers can use this to find inspiration for their own conlang romanizations.<br>
<br>
Note that in this article combining diacritics are attached to a ◌. Diacritics without a ◌, like ¨ for example, are non-combining. Non-combining diacritics are sometimes called modifier letters in Unicode. The non-combining forms may for example be used when writing about a conlang's orthography, when one wants to refer to a diacritic without using any base letter with it.<br>
Conlangs and transcription systems are also included in these articles. If you want to contribute with your own conlangs, or natlang examples, please read first the design guidelines on the [[Talk:Natlang_Uses_of_Diacritics_in_the_Latin_Alphabet|talk page]].
When a letter is referred to without concerning about case, it is displayed like so: Ťť. This is for clarity's sake because some diacritics may look different depending on the letter's case.


{| class="wikitable"
{| class="wikitable"
|+ Diacritics
|+ List of Articles on Natlang Usage of Latin Alphabet Diacritics
! Diacritic name !! Other names !! Character !! Notes
! Diacritic name !! Other names !! Character !! Notes
|-
|-
| [[Natlang_Uses_of_Acute_Accent|Acute accent]] || || style="font-size:180%" | ˊ ||
| [[Acute_Accent|Acute accent]] || Kreska || style="font-size:180%" | ˊ || Although Łł is considered to be an Ll with kreska in Polish typography, this letter is listed under [[Stroke|stroke]].
|-
|-
| [[Natlang_Uses_of_Bar|Bar]] || Stroke, horizontal bar, middle tilde || style="font-size:180%" | ◌̵ || Eth (Ðð) and capital African D (Ɖ) are listed here. See also [[Natlang_Uses_of_Stroke|stroke]].
| [[Acute_Accent_Below|Acute accent below]] || || style="font-size:180%" | ˏ ||
|-
|-
| [[Natlang_Uses_of_Breve|Breve]] || || style="font-size:180%" | ˘ ||
| [[Bar]] || Stroke, horizontal bar, middle tilde || style="font-size:180%" | ◌̵ || Eth (Ðð) and capital African D (Ɖ) are listed here. See also [[Stroke|stroke]].
|-
|-
| [[Natlang_Uses_of_Caron|Caron]] || Háček, haček || style="font-size:180%" | ˇ ||
| [[Breve]] || || style="font-size:180%" | ˘ ||
|-
|-
| [[Natlang_Uses_of_Cedilla|Cedilla]] || || style="font-size:180%" | ¸ || Some of the letters included here have in practice comma below, but Şş and Ţţ are listed under [[Natlang_Uses_of_Comma_Below|comma below]].
| [[Breve_Below|Breve below]] || || style="font-size:180%" | ◌̮ ||
|-
|-
| [[Natlang_Uses_of_Circumflex|Circumflex]] || || style="font-size:180%" | ˆ ||
| [[Candrabindu]] || Chandrabindu, chandravindu, candravindu, chôndrobindu || style="font-size:180%" | ◌̐ ||
|-
|-
| [[Natlang_Uses_of_Comma_Below|Comma below]] || || style="font-size:180%" | ◌̦ || This article includes Şş and Ţţ, but not other letters containing a comma looking diacritic. Instead, see [[Natlang_Uses_of_Cedilla|cedilla]].
| [[Caron]] || Háček, haček || style="font-size:180%" | ˇ ||
|-
|-
| [[Natlang_Uses_of_Diaeresis_and_Umlaut|Diaeresis/umlaut]] || Tréma, trema || style="font-size:180%" | ¨ ||
| [[Cedilla]] || || style="font-size:180%" | ¸ || Some of the letters included here have in practice comma below, but Ss and Tt with comma below are listed under [[Comma_Below|comma below]].
|-
|-
| [[Natlang_Uses_of_Dot_Above|Dot above]] || || style="font-size:180%" | ˙ ||
| [[Circumflex]] || || style="font-size:180%" | ˆ ||
|-
|-
| [[Natlang_Uses_of_Dot_Below|Dot below]] || Underdot || style="font-size:180%" | ◌̣ ||
| [[Circumflex_Below|Circumflex below]] || || style="font-size:180%" | ◌̭ ||
|-
|-
| [[Natlang_Uses_of_Double_Acute_Accent|Double acute accent]] || Hungarumlaut || style="font-size:180%" | ˝ ||
| [[Comma_Above|Comma above]] || Psilòn pneûma, psilon pneuma, psilí, psili, spīritus lēnis, spiritus lenis || style="font-size:180%" | ᾿ || Actually a greek script diacritic, but used in some Latin alphabets.
|-
|-
| [[Natlang_Uses_of_Double_Grave_Accent|Double grave accent]] || || style="font-size:180%" |​ ◌̏ ||
| [[Comma_Above_Right|Comma above right]] || || style="font-size:180%" | ◌̕ ||
|-
|-
| [[Natlang_Uses_of_Grave_Accent|Grave accent]] || || style="font-size:180%" | ˋ ||
| [[Comma_Below|Comma below]] || || style="font-size:180%" | ◌̦ || This article includes Ss and Tt with comma below, but not other letters containing a comma looking diacritic. Instead, see [[Cedilla|cedilla]].
|-
|-
| [[Natlang_Uses_of_Hook_Above|Hook above]] || Dấu hỏi || style="font-size:180%" | ◌̉ ||
| [[Descender]] || || style="font-size:180%" | ||
|-
|-
| [[Natlang_Uses_of_Horn|Horn]] || Dấu móc || style="font-size:180%" | ◌̛ ||
| [[Diaeresis_and_Umlaut|Diaeresis/umlaut]] || Tréma, trema || style="font-size:180%" | ¨ ||
|-
|-
| [[Natlang_Uses_of_Inverted_Breve|Inverted breve]] || Arch || style="font-size:180%" | ◌̑ ||
| [[Diaeresis_Below|Diaeresis below]] || || style="font-size:180%" | ◌̤ ||
|-
|-
| [[Natlang_Uses_of_Macron|Macron]] || || style="font-size:180%" | ˉ ||
| [[Dot_Above|Dot above]] || Overdot, anusvāra, anusvara || style="font-size:180%" | ˙ ||
|-
|-
| [[Natlang_Uses_of_Middle_Dot|Middle dot]] || Interpunct, interpoint, centered dot, centred dot, space dot || style="font-size:180%" | · ||
| [[Dot_Above_Right|Dot above right]] || || style="font-size:180%" | ◌͘ ||
|-
|-
| [[Natlang_Uses_of_Ogonek|Ogonek]] || || style="font-size:180%" | ˛ ||
| [[Dot_Below|Dot below]] || Underdot || style="font-size:180%" | ◌̣ ||
|-
|-
| [[Natlang_Uses_of_Retroflex_Hook|Retroflex hook]] || Hook, tail || style="font-size:180%" | ◌̢ ||
| [[Double_Acute_Accent|Double acute accent]] || Hungarumlaut || style="font-size:180%" | ˝ ||
|-
|-
| [[Natlang_Uses_of_Ring_Above|Ring above]] || || style="font-size:180%" | ˚ ||
| [[Double_Grave_Accent|Double grave accent]] || || style="font-size:180%" |​ ◌̏ ||
|-
|-
| [[Natlang_Uses_of_Ring_Below|Ring below]] || || style="font-size:180%" | ˳ ||
| [[Double_Macron_Below|Double macron below]] || || style="font-size:180%" | ◌͟◌ || This diacritic is very similar to [[Low_Line|low line]].
|-
|-
| [[Natlang_Uses_of_Stroke|Stroke]] || Diagonal stroke, solidus, strikethrough || style="font-size:180%" | ◌̷ || [[Natlang_Uses_of_Bar|Bar]] may also be called stroke. Eth (Ðð) is not listed here, but under [[Natlang_Uses_of_Bar|bar]].
| [[Double_Ring_Below|Double ring below]] || || style="font-size:180%" |​ ◌͚ ||
|-
|-
| [[Natlang_Uses_of_Tilde|Tilde]] || || style="font-size:180%" | ˜ ||
| [[Double_Vertical_Line_Above|Double vertical line above]] || || style="font-size:180%" |​ ◌̎ ||
|-
|-
| [[Natlang_Uses_of_Vertical_Line_Below|Vertical line below]] || || style="font-size:180%" | ˌ ||
| [[Grave_Accent|Grave accent]] || || style="font-size:180%" | ˋ ||
|}
 
== Double Grave Accent ==
{| class="wikitable"
|+ Precomposed Letters with Double Grave Accent
| style="font-size:180%" | ˵ || style="font-size:180%" | ◌̏ || style="font-size:180%" | Ȁ || style="font-size:180%" | ȁ || style="font-size:180%" | Ȅ || style="font-size:180%" | ȅ || style="font-size:180%" | Ȉ || style="font-size:180%" | ȉ || style="font-size:180%" | Ȍ || style="font-size:180%" | ȍ || style="font-size:180%" | Ȑ || style="font-size:180%" | ȑ || style="font-size:180%" | Ȕ
|-
|-
| U+02F5 || U+030F || U+0200 || U+0201 || U+0204 || U+0205 || U+0208 || U+0209 || U+020C || U+020D || U+0210 || U+0211 || U+0214
| [[Grave_Accent_Below|Grave accent below]] || || style="font-size:180%" | ˎ ||
|-
|-
| Modifier Letter Middle Double Grave Accent || Combining Double Grave Accent || Latin Capital Letter A With Double Grave || Latin Small Letter A With Double Grave || Latin Capital Letter E With Double Grave || Latin Small Letter E With Double Grave || Latin Capital Letter I With Double Grave || Latin Small Letter I With Double Grave || Latin Capital Letter O With Double Grave || Latin Small Letter O With Double Grave || Latin Capital Letter R With Double Grave || Latin Small Letter R With Double Grave || Latin Capital Letter U With Double Grave
| [[Hook_Above|Hook above]] || Dấu hỏi || style="font-size:180%" | ◌̉ ||
|-
|-
| style="font-size:180%" | ȕ
| [[Horn]] || Dấu móc || style="font-size:180%" | ◌̛ ||
|-
|-
| U+0215
| [[Inverted_Breve|Inverted breve]] || Arch || style="font-size:180%" | ◌̑ ||
|-
|-
| Latin Small Letter U With Double Grave
| [[Low_Line|Low line]] || Underline, underscore || style="font-size:180%" | ◌̲ || This diacritic is very similar to [[Macron_Below|macron below]] and [[Double_Macron_Below|double macron below]].
|}
The double grave accent is not part of the orthography of any language, but it is used in language learning materials of and linguistic publications about [[Wikipedia:Serbian_language|Serbian]], [[Wikipedia:Croatian_language|Croatian]] and [[Wikipedia:Slovene_language|Slovene]].[http://en.wikipedia.org/wiki/Double_grave_accent]
 
{| class="wikitable"
|+ Uses of Double Grave Accent
! Usage
! Language
! Letters
! Notes
|-
|-
| rowspan=2 | Short vowel with pitch accent or tone
| [[Macron]] || || style="font-size:180%" | ˉ ||
| [[Wikipedia:Croatian_language|Croatian]], [[Wikipedia:Serbian_language|Serbian]]
| Ȁȁ /â/, Ȅȅ /ê/, Ȉȉ /î/, Ȍȍ /ô/, Ȑȑ /r̩̂/, Ȕȕ /û/
| The double grave accent marks here a short vowel with falling pitch.[http://en.wikipedia.org/wiki/Serbo-Croatian_phonology#Pitch_accent]
|-
|-
| [[Wikipedia:Slovene_language|Slovene]] (orthography with tonal accentuation)
| [[Macron_Below|Macron below]] || Line below, low macron || style="font-size:180%" | ˍ || See also [[Double_Macron_Below|double macron below]].
| Ȁȁ /à/, Ȅȅ /ɛ̀/, Ẹ̏ẹ̏ /è/, Ȉȉ /ì/, Ȍȍ /ɔ̀/, Ọ̏ọ̏ /ò/, Ȑȑ /ə̀ɾ/, Ȕȕ /ù/
| The double grave accent marks that the vowels are short and have low tone. These letters are not used in the standard orthography of Slovene, but in language materials.[http://en.wikipedia.org/wiki/Slovene_language#Prosody]
|}
 
== Grave Accent ==
{| class="wikitable"
|+ Precomposed Letters with Grave Accent
| style="font-size:180%" | ` || style="font-size:180%" | ˋ || style="font-size:180%" | ˴ || style="font-size:180%" | ◌̀ || style="font-size:180%" | ◌̀ || style="font-size:180%" | À || style="font-size:180%" | à || style="font-size:180%" | Ầ || style="font-size:180%" | ầ || style="font-size:180%" | Ằ || style="font-size:180%" | ằ || style="font-size:180%" | È || style="font-size:180%" | è
|-
|-
| U+0060 || U+02CB || U+02F4 || U+0300 || U+0340 || U+00C0 || U+00E0 || U+1EA6 || U+1EA7 || U+1EB0 || U+1EB1 || U+00C8 || U+00E8
| [[Middle_Dot|Middle dot]] || Interpunct, interpoint, centered dot, centred dot, space dot || style="font-size:180%" | · ||
|-
|-
| Grave Accent || Modifier Letter Grave Accent || Modifier Letter Middle Grave Accent || Combining Grave Accent || Combining Grave Tone Mark || Latin Capital Letter A With Grave || Latin Small Letter A With Grave || Latin Capital Letter A With Circumflex And Grave || Latin Small Letter A With Circumflex And Grave || Latin Capital Letter A With Breve And Grave || Latin Small Letter A With Breve And Grave || Latin Capital Letter E With Grave || Latin Small Letter E With Grave
| [[Ogonek]] || || style="font-size:180%" | ˛ ||
|-
|-
| style="font-size:180%" | Ḕ || style="font-size:180%" | || style="font-size:180%" | Ề || style="font-size:180%" | || style="font-size:180%" | Ì || style="font-size:180%" | ì || style="font-size:180%" | Ǹ || style="font-size:180%" | ǹ || style="font-size:180%" | Ò || style="font-size:180%" | ò || style="font-size:180%" | Ṑ || style="font-size:180%" | ṑ || style="font-size:180%" | Ồ
| [[Palatalized_Hook|Palatalized hook]] || Palatal hook || style="font-size:180%" | ◌̡ ||
|-
|-
| U+1E14 || U+1E15 || U+1EC0 || U+1EC1 || U+00CC || U+00EC || U+01F8 || U+01F9 || U+00D2 || U+00F2 || U+1E50 || U+1E51 || U+1ED2
| [[Retroflex_Hook|Retroflex hook]] || Hook, tail || style="font-size:180%" | ◌̢ ||
|-
|-
| Latin Capital Letter E With Macron And Grave || Latin Small Letter E With Macron And Grave || Latin Capital Letter E With Circumflex And Grave || Latin Small Letter E With Circumflex And Grave || Latin Capital Letter I With Grave || Latin Small Letter I With Grave || Latin Capital Letter N With Grave || Latin Small Letter N With Grave || Latin Capital Letter O With Grave || Latin Small Letter O With Grave || Latin Capital Letter O With Macron And Grave || Latin Small Letter O With Macron And Grave || Latin Capital Letter O With Circumflex And Grave
| [[Right_Half_Ring|Right half ring]] || || style="font-size:180%" | ʾ ||
|-
|-
| style="font-size:180%" | ồ || style="font-size:180%" | || style="font-size:180%" | ờ || style="font-size:180%" | Ù || style="font-size:180%" | ù || style="font-size:180%" | Ǜ || style="font-size:180%" | ǜ || style="font-size:180%" | Ừ || style="font-size:180%" | ừ || style="font-size:180%" | Ẁ || style="font-size:180%" | ẁ || style="font-size:180%" | Ỳ || style="font-size:180%" | ỳ
| [[Ring_Above|Ring above]] || || style="font-size:180%" | ˚ ||
|-
|-
| U+1ED3 || U+1EDC || U+1EDD || U+00D9 || U+00F9 || U+01DB || U+01DC || U+1EEA || U+1EEB || U+1E80 || U+1E81 || U+1EF2 || U+1EF3
| [[Ring_Below|Ring below]] || || style="font-size:180%" | ˳ ||
|-
|-
| Latin Small Letter O With Circumflex And Grave || Latin Capital Letter O With Horn And Grave || Latin Small Letter O With Horn And Grave || Latin Capital Letter U With Grave || Latin Small Letter U With Grave || Latin Capital Letter U With Diaeresis And Grave || Latin Small Letter U With Diaeresis And Grave || Latin Capital Letter U With Horn And Grave || Latin Small Letter U With Horn And Grave || Latin Capital Letter W With Grave || Latin Small Letter W With Grave || Latin Capital Letter Y With Grave || Latin Small Letter Y With Grave
| [[Stroke]] || Diagonal stroke, solidus, strikethrough || style="font-size:180%" | ◌̷ || [[Bar]] may also be called stroke. Eth (Ðð) is not listed here, but under [[Bar|bar]].
|}
 
{| class="wikitable"
|+ Uses of Grave Accent
! Use
! Language
! Letters
! Notes
|-
|-
| Falling tone
| [[Tilde]] || || style="font-size:180%" | ˜ ||
| [[Wikipedia:Vietnamese_language|Vietnamese]]
| Àà /a̤ː˨˩/, Ằằ /a̤˨˩/, Ầầ /ə̤˨˩/, Èè /ɛ̤˨˩/, Ềề /e̤˨˩/, Ìì /i̤˨˩/, Òò /ɔ̤˨˩/, Ồồ /o̤˨˩/, Ờờ /ə̤ː˨˩/, Ùù /ṳ˨˩/, Ừừ /ɨ̤˨˩/, Ỳỳ /i̤˨˩/
| The grave accent stands for low falling tone with breathy voice. There are many exceptions to the phonemic values of these letters though.[http://en.wikipedia.org/wiki/Vietnamese_orthography#Pronunciation]
|-
|-
| rowspan=3 | Short vowel
| [[Tilde_Below|Tilde below]] || || style="font-size:180%" | ˷ ||
| [[Wikipedia:Croatian_language|Croatian]], [[Wikipedia:Serbian_language|Serbian]]
| Àà /ǎ/, Èè /ě/, Ìì /ǐ/, Òò /ǒ/, R̀r̀ /ř̩/, Ùù /ǔ/
| The grave accent marks that these vowels are short and have rising pitch. These letters are not used in the standard orthography of Croatian or Serbian, but in linguistic materials.[http://en.wikipedia.org/wiki/Serbo-Croatian_phonology#Pitch_accent]
|-
|-
| [[Wikipedia:Slovene_language|Slovene]] (orthography with dynamic accentuation)
| [[Tilde_Overlay|Tilde overlay]] || || style="font-size:180%" | ◌̴ ||
| Àà /ˈa/, Èè /ˈɛ/, Ìì /ˈi/, Òò /ˈɔ/, R̀r̀ /ˈəɾ/, Ùù /ˈu/
| The grave accent marks that these vowels are stressed and short, and that Èè and Òò are mid-open vowels rather than mid-close. These letters are not used in the standard orthography of Slovene, but in language materials.[http://en.wikipedia.org/wiki/Slovene_language#Prosody]
|-
|-
| [[Wikipedia:Slovene_language|Slovene]] (orthography with tonal accentuation)
| [[Vertical_Line_Above|Vertical line above]] || || style="font-size:180%" | ˈ ||
| Àà /á/, Èè /ɛ́/, Ẹ̀ẹ̀ /é/, Ìì /í/, Òò /ɔ́/, Ọ̀ọ̀ /ó/, R̀r̀ /ə́ɾ/, Ùù /ú/
| The grave accent marks that these vowels are short and have high pitch. These letters are not used in the standard orthography of Slovene, but in language materials.[http://en.wikipedia.org/wiki/Slovene_language#Prosody]
|-
|-
| Stress
| [[Vertical_Line_Below|Vertical line below]] || || style="font-size:180%" | ˌ ||
| [[Wikipedia:Catalan_language|Catalan]]
| Àà /ˈa/, Èè /ˈɛ/, Òò /ˈɔ/
| The rules for when stress is to be marked in Catalan are quite complex. The grave accent also distinguishes stressed /ɛ ɔ/ from /e o/,[http://en.wikipedia.org/wiki/Catalan_alphabet#Acute_and_grave_accents] see [[Natlang_Uses_of_Diacritics_in_the_Latin_Alphabet#Acute_Accent|Acute Accent]], Catalan section on ''Uses of Acute Accent''.
|}
 
== Hook Above ==
{| class="wikitable"
|+ Precomposed Letters with Hook Above
| style="font-size:180%" | ◌̉ || style="font-size:180%" | Ả || style="font-size:180%" | ả || style="font-size:180%" | Ẩ || style="font-size:180%" | ẩ || style="font-size:180%" | Ẳ || style="font-size:180%" | ẳ || style="font-size:180%" | Ẻ || style="font-size:180%" | ẻ || style="font-size:180%" | Ể || style="font-size:180%" | ể || style="font-size:180%" | Ỉ || style="font-size:180%" | ỉ
|-
| U+0309 || U+1EA2 || U+1EA3 || U+1EA8 || U+1EA9 || U+1EB2 || U+1EB3 || U+1EBA || U+1EBB || U+1EC2 || U+1EC3 || U+1EC8 || U+1EC9
|-
| Combining Hook Above || Latin Capital Letter A With Hook Above || Latin Small Letter A With Hook Above || Latin Capital Letter A With Circumflex And Hook Above || Latin Small Letter A With Circumflex And Hook Above || Latin Capital Letter A With Breve And Hook Above || Latin Small Letter A With Breve And Hook Above || Latin Capital Letter E With Hook Above || Latin Small Letter E With Hook Above || Latin Capital Letter E With Circumflex And Hook Above || Latin Small Letter E With Circumflex And Hook Above || Latin Capital Letter I With Hook Above || Latin Small Letter I With Hook Above
|-
| style="font-size:180%" | Ỏ || style="font-size:180%" | ỏ || style="font-size:180%" | Ổ || style="font-size:180%" | ổ || style="font-size:180%" | Ở || style="font-size:180%" | ở || style="font-size:180%" | Ủ || style="font-size:180%" | ủ || style="font-size:180%" | Ử || style="font-size:180%" | || style="font-size:180%" | || style="font-size:180%" | ỷ
|-
|-
| U+1ECE || U+1ECF || U+1ED4 || U+1ED5 || U+1EDE || U+1EDF || U+1EE6 || U+1EE7 || U+1EEC || U+1EED || U+1EF6 || U+1EF7
| [[Vertical_Tilde|Vertical tilde]] || || style="font-size:180%" | ◌̾ ||
|-
| Latin Capital Letter O With Hook Above || Latin Small Letter O With Hook Above || Latin Capital Letter O With Circumflex And Hook Above || Latin Small Letter O With Circumflex And Hook Above || Latin Capital Letter O With Horn And Hook Above || Latin Small Letter O With Horn And Hook Above || Latin Capital Letter U With Hook Above || Latin Small Letter U With Hook Above || Latin Capital Letter U With Horn And Hook Above || Latin Small Letter U With Horn And Hook Above || Latin Capital Letter Y With Hook Above || Latin Small Letter Y With Hook Above
|}
|}
The hook above (or ''dấu hỏi'') is only used in the Vietnamese Quốc Ngữ orthography, where it marks a tone. Note that this diacritic can be hard to see or discern from other diacritics, especially in small font sizes. When it is stacked on top of another diacritic, it may be cut off in some applications, especially when it appears on a capital letter.


{| class="wikitable"
== Layout Overview ==
|+ Uses of Hook Above
[[File:Caron_article.png|thumb|1138px|An example of a Unicode table, from the article [[Natlang_Uses_of_Caron|Natlang Uses of Caron]]. Notice character similarity warnings both in the article text above, and as a note in the table itself.]]
! Usage
In these articles combining (non-spacing) diacritics are attached to a ◌. Diacritics without a ◌, like ¨ for example, are non-combining (spacing). Non-combining diacritics are sometimes called modifier letters in Unicode. The non-combining forms may for example be used when writing about a conlang's orthography, when one wants to refer to a diacritic without using any base letter with it. Some natlangs even use some diacritics as stand alone characters!<br>
! Language
<br>
! Letters
When a letter is referred to without concerning about case, it is displayed like so: Ťť. This is for clarity's sake because some diacritics may look different depending on the letter's case, as in the previous example. When either only an upper case or a lower case letter is used in an article, it usually refers to that specific case variant. But it can also refer to a character which has only one case.<br>
! Notes
<br>
|-
Sometimes it may be necessary to refer to a digraph, for example Ŀl in Catalan. When a digraph is referenced to without concerning about case, it is written like this: Ŀl ŀl; with a space between the letters. Different languages' orthographies may have different rules about capitalization of the first letter of a word. In most languages, only the first letter of a digraph is capitalized; but there are languages where both letters are capitalized. Which rule a particular orthography, that is examplified in these articles, follows, can thus be discerned from how the article writes the digraph.<br>
| Falling-rising (dipping) tone
<br>
| [[Wikipedia:Vietnamese_language|Vietnamese]]
These articles show first which precomposed letter plus diacritic combinations exist in Unicode, and what their codepoints and Unicode names are. The different forms of the stand alone diacritics are also shown. For example the [[Tilde|tilde]] has three different forms: An "ASCII form" ~, which is used in programming among other things, where the tilde is centered; a non-combining diacritic form ˜, where the tilde has the same position it would have when combined with a base letter; and a combining form ◌̃.<br>
| Ảả /aː˧˩˧/, Ẳẳ /a˧˩˧/, Ẩẩ /ə˧˩˧/, Ẻẻ /ɛ˧˩˧/, Ểể /e˧˩˧/, Ỉỉ /i˧˩˧/, Ỏỏ /ɔ˧˩˧/, Ổổ /o˧˩˧/, Ởở /əː˧˩˧/, Ủủ /u˧˩˧/, Ửử /ɨ˧˩˧/, Ỷỷ /i˧˩˧/
<br>
| There are many exceptions to the phonemic values of these letters.[http://en.wikipedia.org/wiki/Vietnamese_orthography#Pronunciation]
Many diacritics or accented letters look very similar to other characters, for example [[Caron|caron]] ˇ and [[Breve|breve]] ˘. These cases are warned about either in the text at the beginning of the article, or in notes at the table that lists the precomposed characters. It is desirable that all the diacritics in one orthography can be easily told apart, so conlangers devising new orthographies should be careful about this. A conlanger may also mistakenly copypaste a similar looking but wrong character from somewhere to a conlang project, so thereful the articles also list characters that would otherwise be unlikely to normally appear in the same orthography, such as Latin Capital Letter O With Stroke, Ø (U+00D8); and Empty Set, ∅ (U+2205) for example. Cases such as the one with caron ˇ and breve ˘, which concern essentially all characters with this accent, are notified about in the text at the beginning of the article. Cases such as Ø and ∅, which only concern an individual pair of characters, are notified about in the Unicode table.
|}
 
== Horn ==
{| class="wikitable"
|+ Precomposed Letters with Horn
| style="font-size:180%" | ◌̛ || style="font-size:180%" | Ơ || style="font-size:180%" | ơ || style="font-size:180%" | Ớ || style="font-size:180%" | ớ || style="font-size:180%" | Ờ || style="font-size:180%" | ờ || style="font-size:180%" | Ở || style="font-size:180%" | ở || style="font-size:180%" | Ỡ || style="font-size:180%" | ỡ || style="font-size:180%" | Ợ || style="font-size:180%" | ợ
|-
| U+031B || U+01A0 || U+01A1 || U+1EDA || U+1EDB || U+1EDC || U+1EDD || U+1EDE || U+1EDF || U+1EE0 || U+1EE1 || U+1EE2 || U+1EE3
|-
| Combining Horn || Latin Capital Letter O With Horn || Latin Small Letter O With Horn || Latin Capital Letter O With Horn And Acute || Latin Small Letter O With Horn And Acute || Latin Capital Letter O With Horn And Grave || Latin Small Letter O With Horn And Grave || Latin Capital Letter O With Horn And Hook Above || Latin Small Letter O With Horn And Hook Above || Latin Capital Letter O With Horn And Tilde || Latin Small Letter O With Horn And Tilde || Latin Capital Letter O With Horn And Dot Below || Latin Small Letter O With Horn And Dot Below
|-
| style="font-size:180%" | Ư || style="font-size:180%" | ư || style="font-size:180%" | Ứ || style="font-size:180%" | ứ || style="font-size:180%" | Ừ || style="font-size:180%" | ừ || style="font-size:180%" | Ử || style="font-size:180%" | ử || style="font-size:180%" | Ữ || style="font-size:180%" | ữ || style="font-size:180%" | Ự || style="font-size:180%" | ự
|-
| U+01AF || U+01B0 || U+1EE8 || U+1EE9 || U+1EEA || U+1EEB || U+1EEC || U+1EED || U+1EEE || U+1EEF || U+1EF0 || U+1EF1
|-
| Latin Capital Letter U With Horn || Latin Small Letter U With Horn || Latin Capital Letter U With Horn And Acute || Latin Small Letter U With Horn And Acute || Latin Capital Letter U With Horn And Grave || Latin Small Letter U With Horn And Grave || Latin Capital Letter U With Horn And Hook Above || Latin Small Letter U With Horn And Hook Above || Latin Capital Letter U With Horn And Tilde || Latin Small Letter U With Horn And Tilde || Latin Capital Letter U With Horn And Dot Below || Latin Small Letter U With Horn And Dot Below
|}
The horn diacritic (or ''dấu móc'') is only used in the Vietnamese Quốc Ngữ orthography, where it is used for discerning different vowel qualities.
 
{| class="wikitable"
|+ Uses of Horn
! Usage
! Language
! Letters
! Notes
|-
| Unrounded central vowel
| [[Wikipedia:Vietnamese_language|Vietnamese]]
| Ơơ /əː˧/, Ớớ /əː˧˥/, Ờờ /ə̤ː˨˩/, Ởở /əː˧˩˧/, Ỡỡ /əˀː˧˥/, Ợợ /ə̰ːʔ˨˩/, Ưư /ɨ˧/, Ứứ /ɨ˧˥/, Ừừ /ɨ̤˨˩/, Ửử /ɨ˧˩˧/, Ữữ /ɨˀ˧˥/, Ựự /ɨ̰ʔ˨˩/
|
|}
 
== Inverted Breve ==
{| class="wikitable"
|+ Precomposed Letters with Inverted Breve
| style="font-size:180%" | ◌̑ || style="font-size:180%" | Ȃ || style="font-size:180%" | ȃ || style="font-size:180%" | Ȇ || style="font-size:180%" | ȇ || style="font-size:180%" | Ȋ || style="font-size:180%" | ȋ || style="font-size:180%" | Ȏ || style="font-size:180%" | ȏ || style="font-size:180%" | Ȓ || style="font-size:180%" | ȓ || style="font-size:180%" | Ȗ || style="font-size:180%" | ȗ
|-
| U+0311 || U+0202 || U+0203 || U+0206 || U+0207 || U+020A || U+020B || U+020E || U+020F || U+0212 || U+0213 || U+0216 || U+0217
|-
| Combining Inverted Breve || Latin Capital Letter A With Inverted Breve || Latin Small Letter A With Inverted Breve || Latin Capital Letter E With Inverted Breve || Latin Small Letter E With Inverted Breve || Latin Capital Letter I With Inverted Breve || Latin Small Letter I With Inverted Breve || Latin Capital Letter O With Inverted Breve || Latin Small Letter O With Inverted Breve || Latin Capital Letter R With Inverted Breve || Latin Small Letter R With Inverted Breve || Latin Capital Letter U With Inverted Breve || Latin Small Letter U With Inverted Breve
|}
 
The inverted breve is also known as an arch. Note that it may easily be confused with [[Natlang_Uses_of_Diacritics_in_the_Latin_Alphabet#Circumflex|circumflex]]. The inverted breve is not part of the orthography of any language, but it is used in linguistic materials about Serbian, Croatian and Slovene.[http://en.wikipedia.org/wiki/Inverted_breve] It was derived from the circumflex in Ancient Greek.[http://en.wikipedia.org/wiki/Inverted_breve#Serbo-Croatian]
 
{| class="wikitable"
|+ Uses of Inverted Breve
! Usage
! Language
! Letters
! Notes
|-
| rowspan=2 | Long vowel with pitch accent
| [[Wikipedia:Croatian_language|Croatian]], [[Wikipedia:Serbian_language|Serbian]]
| Ȃȃ /âː/, Ȇȇ /êː/, Ȋȋ /îː/, Ȏȏ /ôː/, Ȓȓ /r̩̂ː/, Ȗȗ /ûː/
| The inverted breve marks a long vowel with falling pitch. These letters are not used in the standard orthography of Croatian or Serbian, but in linguistic materials.[http://en.wikipedia.org/wiki/Inverted_breve#Serbo-Croatian]
|-
| [[Wikipedia:Slovene_language|Slovene]] (orthography with tonal accentuation)
| Ȃȃ /àː/, Ȇȇ /ɛ̀ː/, Ẹ̑ẹ̑ /èː/, Ȋȋ /ìː/, Ȏȏ /ɔ̀ː/, Ọ̑ọ̑ /òː/, Ȗȗ /ùː/
| The inverted breve marks a long vowel with low pitch. [[Natlang_Uses_of_Diacritics_in_the_Latin_Alphabet#Circumflex|Circumflex]] may be used instead of the inverted breve. These letters are not used in the standard orthography of Slovene, but in language materials.[http://en.wikipedia.org/wiki/Slovene_language#Prosody]
|}
 
== Macron ==
{| class="wikitable"
|+ Precomposed Letters with Macron
| style="font-size:180%" | ¯ || style="font-size:180%" | ˉ || style="font-size:180%" | ◌̄ || style="font-size:180%" | Ā || style="font-size:180%" | ā || style="font-size:180%" | Ǟ || style="font-size:180%" | ǟ || style="font-size:180%" | Ǡ || style="font-size:180%" | ǡ || style="font-size:180%" | Ǣ || style="font-size:180%" | ǣ || style="font-size:180%" | Ē || style="font-size:180%" | ē
|-
| U+00AF || U+02C9 || U+0304 || U+0100 || U+0101 || U+01DE || U+01DF || U+01E0 || U+01E1 || U+01E2 || U+01E3 || U+0112 || U+0113
|-
| Macron || Modifier Letter Macron || Combining Macron || Latin Capital Letter A With Macron || Latin Small Letter A With Macron || Latin Capital Letter A With Diaeresis And Macron || Latin Small Letter A With Diaeresis And Macron || Latin Capital Letter A With Dot Above And Macron || Latin Small Letter A With Dot Above And Macron || Latin Capital Letter Ae With Macron || Latin Small Letter Ae With Macron || Latin Capital Letter E With Macron || Latin Small Letter E With Macron
|-
| colspan="3" | '''Note:''' May be confused with Overline, ‾ (U+203E); Combining Double Macron, ◌͞ (U+035E); or Superscript Minus, ⁻ (U+207B). || || || || || || || || || ||
|-
| style="font-size:180%" | Ḕ || style="font-size:180%" | ḕ || style="font-size:180%" | Ḗ || style="font-size:180%" | ḗ || style="font-size:180%" | Ḡ || style="font-size:180%" | ḡ || style="font-size:180%" | Ī || style="font-size:180%" | ī || style="font-size:180%" | Ḹ || style="font-size:180%" | ḹ || style="font-size:180%" | Ō || style="font-size:180%" | ō || style="font-size:180%" | Ǭ
|-
| U+1E14 || U+1E15 ||​ U+1E16 || U+1E17 || U+1E20 || U+1E21 || U+012A || U+012B || U+1E38 || U+1E39 || U+014C ||​ U+014D || U+01EC
|-
| Latin Capital Letter E With Macron And Grave || Latin Small Letter E With Macron And Grave || Latin Capital Letter E With Macron And Acute || Latin Small Letter E With Macron And Acute || Latin Capital Letter G With Macron || Latin Small Letter G With Macron || Latin Capital Letter I With Macron || Latin Small Letter I With Macron || Latin Capital Letter L With Dot Below And Macron || Latin Small Letter L With Dot Below And Macron || Latin Capital Letter O With Macron || Latin Small Letter O With Macron || Latin Capital Letter O With Ogonek And Macron
|-
| style="font-size:180%" | ǭ || style="font-size:180%" | Ṑ || style="font-size:180%" | ṑ || style="font-size:180%" | Ṓ || style="font-size:180%" | ṓ || style="font-size:180%" | Ȫ || style="font-size:180%" | ȫ || style="font-size:180%" | Ȭ || style="font-size:180%" | ȭ || style="font-size:180%" | Ȱ || style="font-size:180%" | ȱ || style="font-size:180%" | Ṝ || style="font-size:180%" | ṝ
|-
| U+01ED || U+1E50 || U+1E51 || U+1E52 || U+1E53 || U+022A || U+022B || U+022C || U+022D || U+0230 || U+0231 || U+1E5C || U+1E5D
|-
| Latin Small Letter O With Ogonek And Macron || Latin Capital Letter O With Macron And Grave || Latin Small Letter O With Macron And Grave || Latin Capital Letter O With Macron And Acute || Latin Small Letter O With Macron And Acute || Latin Capital Letter O With Diaeresis And Macron || Latin Small Letter O With Diaeresis And Macron || Latin Capital Letter O With Tilde And Macron || Latin Small Letter O With Tilde And Macron || Latin Capital Letter O With Dot Above And Macron || Latin Small Letter O With Dot Above And Macron || Latin Capital Letter R With Dot Below And Macron || Latin Small Letter R With Dot Below And Macron
|-
| style="font-size:180%" | Ū || style="font-size:180%" | ū || style="font-size:180%" | Ṻ || style="font-size:180%" | ṻ || style="font-size:180%" | Ǖ || style="font-size:180%" | ǖ || style="font-size:180%" | Ȳ || style="font-size:180%" | ȳ
|-
| U+016A || U+016B || U+1E7A || U+1E7B || U+01D5 || U+01D6 || U+0232 || U+0233
|-
| Latin Capital Letter U With Macron || Latin Small Letter U With Macron || Latin Capital Letter U With Macron And Diaeresis || Latin Small Letter U With Macron And Diaeresis || Latin Capital Letter U With Diaeresis And Macron || Latin Small Letter U With Diaeresis And Macron || Latin Capital Letter Y With Macron || Latin Small Letter Y With Macron
|}
 
{| class="wikitable"
|+ Uses of Macron
! Use
! Language
! Letters
! Notes
|-
| rowspan="4" | Long vowel
| [[Wikipedia:Croatian_language|Croatian]], [[Wikipedia:Serbian_language|Serbian]]
| Āā /aː/, Ēē /eː/, Īī /iː/, Ōō /oː/, R̄r̄ /r̩ː/, Ūū /uː/
| The macron marks a long vowel without pitch accent. These letters are not used in the standard orthography of Croatian or Serbian, but in linguistic materials.[http://en.wikipedia.org/wiki/Serbo-Croatian_phonology#Pitch_accent]
|-
| [[Wikipedia:Latgalian_language|Latgalian]]
| Āā /ɑː/, Ēē /eː/, Īī /iː/, Ōō /oː/, Ūū /uː/
|
|-
| [[Wikipedia:Latvian_language|Latvian]]
| Āā /ɑː/, Ēē /eː/ and /æː/, Īī /iː/, Ūū /uː/
|
|-
| [[Wikipedia:Livonian_language|Livonian]]
| Āā /ɑː/, Ǟǟ /æː/, Ēē /ɛː/, Īī /iː/, Ōō /oː/, Ȱȱ /ʊː/, Ȭȭ /ɨː/, Ūū /u/
|
|}
 
== Middle Dot ==
{| class="wikitable"
|+ Precomposed Letters with Middle Dot
| style="font-size:180%" | · || style="font-size:180%" | Ŀ || style="font-size:180%" | ŀ
|-
| U+00B7 || U+013F || U+0140
|-
| Middle Dot || Latin Capital Letter L With Middle Dot || Latin Small Letter L With Middle Dot
|-
| '''Note:''' May be confused with Modifier Letter Half Triangular Colon, ˑ (U+02D1); Bullet, • (U+2022); Bullet Operator, ∙ (U+2219); Dot Operator, ⋅ (U+22C5); or Hyphenation Point, ‧ (U+2027). || ||
|}
The middle dot is also known by the names interpunct, interpoint, centered (centred) dot and space dot. The dot operator ⋅ (U+22C5) may also be called middle dot.[[http://en.wikipedia.org/wiki/Interpunct]<br>
In ancient Latin and several other scripts, the middle dot was used instead of space for separating words.[http://en.wikipedia.org/wiki/Word_divider]
 
{| class="wikitable"
|+ Uses of Middle Dot
! Usage
! Language
! Letters
! Notes
|-
| Digraph disambiguation
| [[Wikipedia:Catalan_language|Catalan]]
| Ŀl ŀl /lː/
| Ll ll without middle dot stands for /ʎ/.[http://en.wikipedia.org/wiki/Catalan_alphabet#Punt_volat_.28middot.29]
|}
 
== Ogonek ==
{| class="wikitable"
|+ Precomposed Letters with Ogonek
| style="font-size:180%" | ˛ || style="font-size:180%" | ◌̨ || style="font-size:180%" | Ą || style="font-size:180%" | ą || style="font-size:180%" | Ę || style="font-size:180%" | ę || style="font-size:180%" | Į || style="font-size:180%" | į || style="font-size:180%" | Ǫ || style="font-size:180%" | ǫ || style="font-size:180%" | Ǭ || style="font-size:180%" | ǭ || style="font-size:180%" | Ų
|-
| U+02DB || U+0328 || U+0104 || U+0105 || U+0118 || U+0119 || U+012E || U+012F || U+01EA || U+01EB || U+01EC || U+01ED || U+0172
|-
| Ogonek || Combining Ogonek || Latin Capital Letter A With Ogonek || Latin Small Letter A With Ogonek || Latin Capital Letter E With Ogonek || Latin Small Letter E With Ogonek || Latin Capital Letter I With Ogonek || Latin Small Letter I With Ogonek || Latin Capital Letter O With Ogonek || Latin Small Letter O With Ogonek || Latin Capital Letter O With Ogonek And Macron || Latin Small Letter O With Ogonek And Macron || Latin Capital Letter U With Ogonek
|-
| style="font-size:180%" | ų
|-
| U+0173
|-
| Latin Small Letter U With Ogonek
|}
In European languages the ogonek is attached to the right side of Aa, Ee and u, but in Native American languages it is supposed to be placed directly under the letter if technically possible.[http://en.wikipedia.org/wiki/Ogonek#Typographical_notes] There are no separate Unicode poins for these variants. Note that the ogonek may be confused with [[Natlang_Uses_of_Diacritics_in_the_Latin_Alphabet#Cedilla|cedilla]] ¸.
 
{| class="wikitable"
|+ Uses of Ogonek
! Usage
! Language
! Letters
! Notes
|-
|
|
|
|
|}
 
== Retroflex Hook ==
{| class="wikitable"
|+ Precomposed Letters with Retroflex Hook
| style="font-size:180%" | ◌̢ || style="font-size:180%" | ɖ || style="font-size:180%" | ʯ || style="font-size:180%" | ɭ || style="font-size:180%" | ᶩ || style="font-size:180%" | ɳ || style="font-size:180%" | ᶯ || style="font-size:180%" | Ɽ || style="font-size:180%" | ɽ || style="font-size:180%" | ɻ || style="font-size:180%" | ʵ || style="font-size:180%" | ʂ || style="font-size:180%" | ᶳ
|-
| U+0322 || U+0256 || U+02AF || U+026D || U+1DA9 || U+0273 || U+1DAF || U+2C64 || U+027D || U+027B || U+02B5 || U+0282 || U+1DB3
|-
| Combining Retroflex Hook Below || Latin Small Letter D With Tail || Latin Small Letter Turned H With Fishhook And Tail || Latin Small Letter L With Retroflex Hook || Modifier Letter Small L With Retroflex Hook || Latin Small Letter N With Retroflex Hook || Modifier Letter Small N With Retroflex Hook || Latin Capital Letter R With Tail || Latin Small Letter R With Tail || Latin Small Letter Turned R With Hook || Modifier Letter Small Turned R With Hook || Latin Small Letter S With Hook || Modifier Letter Small S With Hook
|-
| '''Note:''' It is not recommended that this combining diacritic is used with letters to create new characters.[http://en.wikipedia.org/wiki/Hook_(diacritic)#Unicode] || '''Note:''' Upper case of this letter is Latin Capital Letter African D, [[Natlang_Uses_of_Diacritics_in_the_Latin_Alphabet#Bar|Ɖ]] (U+0189). || '''Note:''' Phonetic character used by sinologist to denote [ʐ̩ʷ].[http://en.wikipedia.org/wiki/Syllabic_consonant#Syllabic_fricatives] Not used in any orthography. || '''Note:''' Phonetic character used in [[Wikipedia:International_Phonetic_Alphabet|IPA]]. Not used in any orthography. || '''Note:''' Phonetic character; not used in any orthography. || '''Note:''' Phonetic character used in [[Wikipedia:International_Phonetic_Alphabet|IPA]]. Not used in any orthography. || '''Note:''' Phonetic character; not used in any orthography. || || || '''Note:''' Phonetic character used in [[Wikipedia:International_Phonetic_Alphabet|IPA]]. Not used in any orthography. || '''Note:''' Phonetic character; not used in any orthography. || '''Note:''' Phonetic character used in [[Wikipedia:International_Phonetic_Alphabet|IPA]]. Not used in any orthography. || '''Note:''' Phonetic character; not used in any orthography.
|-
| style="font-size:180%" | Ʈ || style="font-size:180%" | ʈ || style="font-size:180%" | ʐ || style="font-size:180%" | ᶼ
|-
| U+01AE || U+0288 || U+0290 || U+1DBC
|-
| Latin Capital Letter T With Retroflex Hook || Latin Small Letter T With Retroflex Hook || Latin Small Letter Z With Retroflex Hook || Modifier Letter Small Z With Retroflex Hook
|-
| || || '''Note:''' Phonetic character used in [[Wikipedia:International_Phonetic_Alphabet|IPA]]. Not used in any orthography. || '''Note:''' Phonetic character; not used in any orthography.
|}
The retroflex hook is variously also called just a hook, or a tail, in Unicode. They all have in common that it is a hook turning towards the right, attached to the bottom of a letter. The majority of these letters are used in [[Wikipedia:International_Phonetic_Alphabet|IPA]] to represent retroflex consonants.<br>
Note that the retroflex hook is easily confused with the similar looking [[Natlang_Uses_of_Diacritics_in_the_Latin_Alphabet#Palatalized_Hook|Palatalized Hook]] ◌̡. There are also letters with other hooks, such as Ɓɓ, Ƈƈ, Ɗɗ, Ƒƒ, Ɠɠ, [[Wikipedia:Voiced_uvular_implosive|ʛ]], [[Wikipedia:Voiced_glottal_fricative|ɦ]], [[Wikipedia:Sj-sound|ɧ]], Ƙƙ, [[Wikipedia:Labiodental_nasal|ɱ]], Ɲɲ, Ɋɋ, [[Wikipedia:Q_with_hook|ʠ]], Ƥƥ, Ƭƭ, Ʋʋ, Ƴƴ, Ȥȥ.
 
{| class="wikitable"
|+ Uses of Retroflex Hook
! Usage
! Language
! Letters
! Notes
|-
|
|
|
|
|}
 
== Ring Above ==
{| class="wikitable"
|+ Precomposed Letters with Ring Above
| style="font-size:180%" | ˚ || style="font-size:180%" | ◌̊ || style="font-size:180%" | Å || style="font-size:180%" | å || style="font-size:180%" | Ǻ || style="font-size:180%" | ǻ || style="font-size:180%" | Ů || style="font-size:180%" | ů || style="font-size:180%" | ẘ || style="font-size:180%" | ẙ
|-
| U+02DA || U+030A || U+00C5 || U+00E5 || U+01FA || U+01FB || U+016E || U+016F || U+1E98 || U+1E99
|-
| Ring Above || Combining Ring Above || Latin Capital Letter A With Ring Above || Latin Small Letter A With Ring Above || Latin Capital Letter A With Ring Above And Acute || Latin Small Letter A With Ring Above And Acute || Latin Capital Letter U With Ring Above || Latin Small Letter U With Ring Above || Latin Small Letter W With Ring Above || Latin Small Letter Y With Ring Above
|-
| colspan="2" | '''Note:''' May be confused with the Degree Sign ° (U+00B0) || '''Note:''' May be confused with the Ångström Sign Å (U+212B). || || || || || || ||
|}


{| class="wikitable"
{| class="wikitable"
Line 419: Line 137:
| This comes from a diphthong /uo/, where the o was sometimes written as a ring above the u. A sound change then turned /uo/ into /uː/.[http://en.wikipedia.org/wiki/Czech_orthography#Letter_.C5.AE]
| This comes from a diphthong /uo/, where the o was sometimes written as a ring above the u. A sound change then turned /uo/ into /uː/.[http://en.wikipedia.org/wiki/Czech_orthography#Letter_.C5.AE]
|}
|}
After the precomposed characters have been presented, comes examples of how natlangs use the diacritic. (Natromanizations of other scripts are also included.) For each language, the letters and the phonemes they represent are listed. The notes may contain a short history of why the characters are used in this way in the given language. These explanations are very short though, so often times one can read more about it by clicking the reference link ([1], [2] or [3] above). The notes may also give more information about a characters usage, when it is not quite straight forward, or when it differs a little from the other characters in the same group.


== Ring Below ==
==Further reading==
{| class="wikitable"
* [http://www.phon.ucl.ac.uk/home/wells/dia/diacritics-revised.htm J.C. Wells: ''Orthographic diacritics and multilingual computing'']
|+ Precomposed Letters with Ring Below
| style="font-size:180%" | ˳ || style="font-size:180%" | ◌̥ || style="font-size:180%" | Ḁ || style="font-size:180%" | ḁ
|-
| U+02F3 || U+0325 || U+1E00 || U+1E01
|-
| Modifier Letter Low Ring || Combining Ring Below || Latin Capital Letter A With Ring Below || Latin Small Letter A With Ring Below
|}
Note that this diacritic may be confused with [[Natlang_Uses_of_Diacritics_in_the_Latin_Alphabet#Dot_Below|dot below]] ◌̣ or [[Wikipedia:Laminal_consonant|square below]] ◌̻, especially in small font sizes.
 
{| class="wikitable"
|+ Uses of Ring Below
! Usage
! Language
! Letters
! Notes
|-
| Syllabic consonant
| [[Wikipedia:ISO_15919|ISO 15919]] romanization of Indic scripts
| L̥l̥ /l̩/, L̥̄l̥̄ /l̩ː/, R̥r̥ /r̩/, R̥̄r̥̄ /r̩ː/
|
|}
 
== Stroke ==
{| class="wikitable"
|+ Precomposed Letters with Stroke
| style="font-size:180%" | ◌̸ || style="font-size:180%" | ◌̷ || style="font-size:180%" | Ⱥ || style="font-size:180%" | ⱥ || style="font-size:180%" | Ȼ || style="font-size:180%" | ȼ || style="font-size:180%" | Ɇ || style="font-size:180%" | ɇ || style="font-size:180%" | Ł || style="font-size:180%" | ł || style="font-size:180%" | ᴌ || style="font-size:180%" | ƛ || style="font-size:180%" | Ø
|-
| U+0338 || U+0337 || U+023A || U+2C65 || U+023B || U+023C || U+0246 || U+0247 || U+0141 || U+0142 || U+1D0C || U+019B || U+00D8
|-
| Combining Long Solidus Overlay || Combining Short Solidus Overlay || Latin Capital Letter A With Stroke || Latin Small Letter A With Stroke || Latin Capital Letter C With Stroke || Latin Small Letter C With Stroke || Latin Capital Letter E With Stroke || Latin Small Letter E With Stroke || Latin Capital Letter L With Stroke || Latin Small Letter L With Stroke || Latin Letter Small Capital L With Stroke || Latin Small Letter Lambda With Stroke || Latin Capital Letter O With Stroke
|-
| || || || || '''Note:''' May be confused with Cedi Sign, ₵ (U+20B5). || '''Note:''' May be confused with Cent Sign, ¢ (U+00A2). || || ||​ || || '''Note:''' Phonetic character; not used in any orthography. || || '''Note:''' May be confused with Empty Set, ∅ (U+2205).
|-
| style="font-size:180%" | ø || style="font-size:180%" | ᴓ || style="font-size:180%" | Ǿ || style="font-size:180%" | ǿ || style="font-size:180%" | ẜ || style="font-size:180%" | Ⱦ || style="font-size:180%" | ⱦ || style="font-size:180%" | ᵺ
|-
| U+00F8 || U+1D13 || U+01FE || U+01FF || U+1E9C || U+023E || U+2C66 || U+1D7A
|-
| Latin Small Letter O With Stroke || Latin Small Letter Sideways O With Stroke || Latin Capital Letter O With Stroke And Acute || Latin Small Letter O With Stroke And Acute || Latin Small Letter Long S With Diagonal Stroke || Latin Capital Letter T With Diagonal Stroke || Latin Small Letter T With Diagonal Stroke || Latin Small Letter Th With Strikethrough
|-
| '''Note:''' May be confused with Diameter Sign, ⌀ (U+2300). || '''Note:''' Phonetic character; not used in any orthography. || || || || || || '''Note:''' Phonetic character used in some American dictionaries.[http://www.fileformat.info/info/unicode/char/1d7a/index.htm] Not used in any orthography.
|}
This diacritic, and the one consisting of a horizontal [[Natlang_Uses_of_Diacritics_in_the_Latin_Alphabet#Bar|bar]], may both be called stroke in Unicode. In this article they are treated as two separate diacritics. Latin Small Letter Eth, ð, is listed under the [[Natlang_Uses_of_Diacritics_in_the_Latin_Alphabet#Bar|bar]] diacritic. There are several currency symbols and mathematical symbols with strokes, but they are not included here.
 
{| class="wikitable"
|+ Uses of Stroke
! Usage
! Language
! Letters
! Notes
|-
| Letter extension
| [[Wikipedia:Sahaptin_language|Sahaptin]]
| ƛ /t͡ɬ/, ƛ’ /t͡ɬʼ/
| According to Wikipedia, it is "used in transcribing Sahaptin".[http://en.wikipedia.org/wiki/Barred_lambda]
|-
| Other
| [[Wikipedia:Polish language|Polish]]
| Łł /w/
| Historically it stood for /ɫ/.
|}
 
== Tilde ==
{| class="wikitable"
|+ Precomposed Letters with Tilde
| style="font-size:180%" | ~ || style="font-size:180%" | ˜ || style="font-size:180%" |​ ◌̃ || style="font-size:180%" | Ã || style="font-size:180%" | ã || style="font-size:180%" | Ẫ || style="font-size:180%" | ẫ || style="font-size:180%" | Ẵ || style="font-size:180%" | ẵ || style="font-size:180%" | Ẽ || style="font-size:180%" | ẽ || style="font-size:180%" | Ễ || style="font-size:180%" | ễ
|-
| U+007E || U+02DC || U+0303 ||​ U+00C3 || U+00E3 ||​ U+1EAA || U+1EAB || U+1EB4 || U+1EB5 || U+1EBC || U+1EBD || U+1EC4 || U+1EC5
|-
| Tilde || Small Tilde || Combining Tilde || Latin Capital Letter A With Tilde || Latin Small Letter A With Tilde || Latin Capital Letter A With Circumflex And Tilde || Latin Small Letter A With Circumflex And Tilde || Latin Capital Letter A With Breve And Tilde || Latin Small Letter A With Breve And Tilde || Latin Capital Letter E With Tilde || Latin Small Letter E With Tilde || Latin Capital Letter E With Circumflex And Tilde || Latin Small Letter E With Circumflex And Tilde
|-
| '''Note:''' May be confused with swung dash ⁓ (U+2053). || || || || || || || || || || || ||
|-
| style="font-size:180%" | Ĩ || style="font-size:180%" | ĩ || style="font-size:180%" | Ñ || style="font-size:180%" | ñ || style="font-size:180%" | Õ || style="font-size:180%" | õ || style="font-size:180%" | Ȭ || style="font-size:180%" | ȭ || style="font-size:180%" | Ṍ || style="font-size:180%" | ṍ || style="font-size:180%" | Ṏ || style="font-size:180%" | ṏ || style="font-size:180%" | Ỗ
|-
| U+0128 || U+0129 ||​ U+00D1 || U+00F1 || U+00D5 || U+00F5 || U+022C || U+022D || U+1E4C || U+1E4D || U+1E4E || U+1E4F || U+1ED6
|-
| Latin Capital Letter I With Tilde || Latin Small Letter I With Tilde || Latin Capital Letter N With Tilde || Latin Small Letter N With Tilde || Latin Capital Letter O With Tilde || Latin Small Letter O With Tilde || Latin Capital Letter O With Tilde And Macron || Latin Small Letter O With Tilde And Macron || Latin Capital Letter O With Tilde And Acute || Latin Small Letter O With Tilde And Acute || Latin Capital Letter O With Tilde And Diaeresis || Latin Small Letter O With Tilde And Diaeresis || Latin Capital Letter O With Circumflex And Tilde
|-
| style="font-size:180%" | ỗ || style="font-size:180%" | Ỡ || style="font-size:180%" | ỡ || style="font-size:180%" | Ũ || style="font-size:180%" | ũ || style="font-size:180%" | Ṹ || style="font-size:180%" | ṹ || style="font-size:180%" | Ữ || style="font-size:180%" | ữ || style="font-size:180%" | Ṽ || style="font-size:180%" | ṽ || style="font-size:180%" | Ỹ || style="font-size:180%" | ỹ
|-
| U+1ED7 || U+1EE0 || U+1EE1 || U+0168 || U+0169 || U+1E78 || U+1E79 || U+1EEE || U+1EEF || U+1E7C || U+1E7D || U+1EF8 || U+1EF9
|-
| Latin Small Letter O With Circumflex And Tilde || Latin Capital Letter O With Horn And Tilde || Latin Small Letter O With Horn And Tilde || Latin Capital Letter U With Tilde || Latin Small Letter U With Tilde || Latin Capital Letter U With Tilde And Acute || Latin Small Letter U With Tilde And Acute || Latin Capital Letter U With Horn And Tilde || Latin Small Letter U With Horn And Tilde || Latin Capital Letter V With Tilde || Latin Small Letter V With Tilde || Latin Capital Letter Y With Tilde || Latin Small Letter Y With Tilde
|}
 
{| class="wikitable"
|+ Uses of Tilde
! Use
! Language
! Letters
! Notes
|-
| Glottalized vowel
| [[Wikipedia:Vietnamese_language|Vietnamese]]
| Ãã /aˀː˧˥/, Ẵẵ /aˀ˧˥/, Ẫẫ /əˀ˧˥/, Ẽẽ /ɛˀ˧˥/, Ễễ /eˀ˧˥/, Ĩĩ /iˀ˧˥/, Õõ /ɔˀ˧˥/, Ỗỗ /oˀ˧˥/, Ỡỡ /əˀː˧˥/, Ũũ /uˀ˧˥/, Ữữ /ɨˀ˧˥/, Ỹỹ /iˀ˧˥/
| The tilde stands for mid rising tone interrupted by a glottal stop.[http://en.wikipedia.org/wiki/Vietnamese_language#Tones_2] There are many exceptions to the phonemic values of these letters though.[http://en.wikipedia.org/wiki/Vietnamese_orthography#Pronunciation]
|-
| Unrounded vowel
| [[Wikipedia:Estonian_language|Estonian]]
| Õõ /ɤ/
|
|-
| rowspan=2 | Other
| [[Wikipedia:ISO_15919|ISO 15919]] romanization of Indic scripts
| Ññ
| Ññ is used for transcribing the Indic diacritic [[Wikipedia:Anusvara|anusvāra]] before palatal consonants.[http://en.wikipedia.org/wiki/ISO_15919#Comparison_with_UNRSGN_and_IAST]
|-
| [[Wikipedia:Livonian_language|Livonian]]
| Õõ /ɨ/, Ȭȭ /ɨː/
|
|}
 
== Vertical Line Below ==
{| class="wikitable"
|+ Precomposed Letters with Vertical Line Below
| style="font-size:180%; width:"100px" | ˌ || style="font-size:180%; width:"100px" | ◌̩
|-
| U+02CC || U+0329
|-
| Modifier Letter Low Vertical Line || Combining Vertical Line Below
|-
| '''Note:''' Whether this can be said to be the non-combining version of vertical line below is open to debate. This character is used for marking secondary stress in [[IPA]], while combining vertical line below is used for marking syllabic consonants in [[IPA]]. ||
|}
No actual precomposed letters with this diacritic actually exist.
 
{| class="wikitable"
|+ Uses of Vertical Line Below
! Usage
! Language
! Letters
! Notes
|-
| Lowered vowel with retracted tongue root
| [[Wikipedia:Yoruba_language|Yoruba]] (current Nigerian alphabet)
| E̩e̩ /ɛ̙/, O̩o̩ /ɔ̙/
| rowspan=2 | The vertical line below replaced an earlier [[Natlang_Uses_of_Diacritics_in_the_Latin_Alphabet#Dot_Below|dot below]]. This is because the dots get covered when a word is underlined. In Benin, a different alphabet is used for Yoruba.[http://en.wikipedia.org/wiki/Yoruba_language#Writing_system]
|-
| Postalveolar consonant
| [[Wikipedia:Yoruba_language|Yoruba]] (current Nigerian alphabet)
| S̩s̩ /ʃ/
|}


[[Category:Natscripts]]
[[Category:Natscripts]]

Latest revision as of 02:42, 6 July 2021

This is a collection of articles that list different uses of diacritical marks that have natlang precedence. Conlangers can use this to find inspiration for their own conlang's orthography or transliteration. These articles could also be used as reference for those designing a keyboard layout.

Conlangs and transcription systems are also included in these articles. If you want to contribute with your own conlangs, or natlang examples, please read first the design guidelines on the talk page.

List of Articles on Natlang Usage of Latin Alphabet Diacritics
Diacritic name Other names Character Notes
Acute accent Kreska ˊ Although Łł is considered to be an Ll with kreska in Polish typography, this letter is listed under stroke.
Acute accent below ˏ
Bar Stroke, horizontal bar, middle tilde ◌̵ Eth (Ðð) and capital African D (Ɖ) are listed here. See also stroke.
Breve ˘
Breve below ◌̮
Candrabindu Chandrabindu, chandravindu, candravindu, chôndrobindu ◌̐
Caron Háček, haček ˇ
Cedilla ¸ Some of the letters included here have in practice comma below, but Ss and Tt with comma below are listed under comma below.
Circumflex ˆ
Circumflex below ◌̭
Comma above Psilòn pneûma, psilon pneuma, psilí, psili, spīritus lēnis, spiritus lenis ᾿ Actually a greek script diacritic, but used in some Latin alphabets.
Comma above right ◌̕
Comma below ◌̦ This article includes Ss and Tt with comma below, but not other letters containing a comma looking diacritic. Instead, see cedilla.
Descender
Diaeresis/umlaut Tréma, trema ¨
Diaeresis below ◌̤
Dot above Overdot, anusvāra, anusvara ˙
Dot above right ◌͘
Dot below Underdot ◌̣
Double acute accent Hungarumlaut ˝
Double grave accent ​ ◌̏
Double macron below ◌͟◌ This diacritic is very similar to low line.
Double ring below ​ ◌͚
Double vertical line above ​ ◌̎
Grave accent ˋ
Grave accent below ˎ
Hook above Dấu hỏi ◌̉
Horn Dấu móc ◌̛
Inverted breve Arch ◌̑
Low line Underline, underscore ◌̲ This diacritic is very similar to macron below and double macron below.
Macron ˉ
Macron below Line below, low macron ˍ See also double macron below.
Middle dot Interpunct, interpoint, centered dot, centred dot, space dot ·
Ogonek ˛
Palatalized hook Palatal hook ◌̡
Retroflex hook Hook, tail ◌̢
Right half ring ʾ
Ring above ˚
Ring below ˳
Stroke Diagonal stroke, solidus, strikethrough ◌̷ Bar may also be called stroke. Eth (Ðð) is not listed here, but under bar.
Tilde ˜
Tilde below ˷
Tilde overlay ◌̴
Vertical line above ˈ
Vertical line below ˌ
Vertical tilde ◌̾

Layout Overview

An example of a Unicode table, from the article Natlang Uses of Caron. Notice character similarity warnings both in the article text above, and as a note in the table itself.

In these articles combining (non-spacing) diacritics are attached to a ◌. Diacritics without a ◌, like ¨ for example, are non-combining (spacing). Non-combining diacritics are sometimes called modifier letters in Unicode. The non-combining forms may for example be used when writing about a conlang's orthography, when one wants to refer to a diacritic without using any base letter with it. Some natlangs even use some diacritics as stand alone characters!

When a letter is referred to without concerning about case, it is displayed like so: Ťť. This is for clarity's sake because some diacritics may look different depending on the letter's case, as in the previous example. When either only an upper case or a lower case letter is used in an article, it usually refers to that specific case variant. But it can also refer to a character which has only one case.

Sometimes it may be necessary to refer to a digraph, for example Ŀl in Catalan. When a digraph is referenced to without concerning about case, it is written like this: Ŀl ŀl; with a space between the letters. Different languages' orthographies may have different rules about capitalization of the first letter of a word. In most languages, only the first letter of a digraph is capitalized; but there are languages where both letters are capitalized. Which rule a particular orthography, that is examplified in these articles, follows, can thus be discerned from how the article writes the digraph.

These articles show first which precomposed letter plus diacritic combinations exist in Unicode, and what their codepoints and Unicode names are. The different forms of the stand alone diacritics are also shown. For example the tilde has three different forms: An "ASCII form" ~, which is used in programming among other things, where the tilde is centered; a non-combining diacritic form ˜, where the tilde has the same position it would have when combined with a base letter; and a combining form ◌̃.

Many diacritics or accented letters look very similar to other characters, for example caron ˇ and breve ˘. These cases are warned about either in the text at the beginning of the article, or in notes at the table that lists the precomposed characters. It is desirable that all the diacritics in one orthography can be easily told apart, so conlangers devising new orthographies should be careful about this. A conlanger may also mistakenly copypaste a similar looking but wrong character from somewhere to a conlang project, so thereful the articles also list characters that would otherwise be unlikely to normally appear in the same orthography, such as Latin Capital Letter O With Stroke, Ø (U+00D8); and Empty Set, ∅ (U+2205) for example. Cases such as the one with caron ˇ and breve ˘, which concern essentially all characters with this accent, are notified about in the text at the beginning of the article. Cases such as Ø and ∅, which only concern an individual pair of characters, are notified about in the Unicode table.

Uses of Ring Above
Use Language Letters Notes
Back version of front vowel. Often also rounded. Chamorro Åå /ɑ/
Danish, Norwegian Åå /ɔ/ From an earlier digraph aa representing /ɔ/, which in turn came from /aː/.[1]
Swedish Åå /o/ From an earlier digraph aa representing /ɔ/, which in turn came from /aː/.[2]
Long vowel Czech Ůů /uː/ This comes from a diphthong /uo/, where the o was sometimes written as a ring above the u. A sound change then turned /uo/ into /uː/.[3]

After the precomposed characters have been presented, comes examples of how natlangs use the diacritic. (Natromanizations of other scripts are also included.) For each language, the letters and the phonemes they represent are listed. The notes may contain a short history of why the characters are used in this way in the given language. These explanations are very short though, so often times one can read more about it by clicking the reference link ([1], [2] or [3] above). The notes may also give more information about a characters usage, when it is not quite straight forward, or when it differs a little from the other characters in the same group.

Further reading