Conlang Recognition Chart

This article describes a variety of simple clues one can use to determine what conlang a document is written in with high accuracy.

Ayeri

Orthography: ptkbdgmnvshrly aāeēiīoōuū (older transcriptions use c for k)

Non-ASCII: āēīōū (ạẹịọụ ạ̄ẹ̄ị̄ọ̄ụ̄ have also been used in older transcriptions)
Unused ASCII: cfjqwxyz

Dipthongs: au, ay, ey, oy (also uses 'iy' frequently, but that is no diphthong)
Common words: ang, sira, eng, le, si, yam, ya
Common morphemes: -ang, -aris, -reng, -ley, -yam, -ea, -iya, -ara, -in, -on
Other common features: Words can get quite long due to its agglutinativeness.

Calénnawn

Non-ASCII: áéíóú àèìòù ë ðñ and $ or š
Unused ASCII: jk
Diphthongs: aw, iw, ow, ay, ey, oy, uy
Digraph: ii
x and q are common
Words starting with f- or s- (like f-qúba)
Words of more than one syllable contain at least one acute accent
Common one- and two-letter words: a, e, i, h, o, on, so, se, fh, el, en, iw, fa

Ebisédian

ASCII orthography:

Uses w, y, 3 and 0 as vowel letters
Upper- and lowercase consonants are distinct (e. g., K vs. k)
Use of double vowel letters to indicate length: 00, ww.
Use of apostrophe after vowels to indicate stress: 00', yy'.

LaTeX orthography:

Use of ø and ɜ as vowel letters
Multiple diacritics over single vowel letters, up to 4 (macron, acute, tear-drop accent, subscript tilde).
Subscript tilde to indicate nasality.
Tear-drop accent in vowel-initial words (looks like a superscript opening left single quote)

General:

Common single-word sentences with i in the last syllable.
Common words: Ke, ve, ke, je, re (always clause-final), keve, tømø, tɜmɜ, timi, tama, tumu.

Germanech

Non-ASCII: Ää Öö Üü Éé
Digraphs: ch cj dj gj tj tz
Common words: a de ez ést héz il la las los

Minza

Non-ASCII: ċ č ł ŋ ö ř š ż ž
Unused ASCII: q w x
Digraphs: ch, gh
Combinations: ië, yö, uö, öy, -h after vowels, łř, nř
Common words: ai, ba, ċi, die, en, fi, ida, ja, kam, keh, ła, łu, min, nu, öych, ři, šei, šö, vö, yn, zmi

Qþyn|gài

Non-ASCII: Þþ|ǂáíúýàìùỳ
Unused ASCII: bpmfvweoczj
Combinations: nq qþ rq ql tl hh nǂg n!g n||g ǂk ái áu úi íu ài àu ùi ìu
All words start with a consonant and end with a vowel
Very long words

Regimonti

Regimonti is a Romance language with vocabulary based on classical Latin rather than Vulgar Latin.

Its name is "Rumanşa" in Regimonti
Latin Alphabet with three additional characters: è, ņ, ş which make the following sounds: /E/ /J/ /S/ respectively.
Diphthongs: ai, au, oi, ua
common words: unu, una, lu, la. First person singular pronoun: O
Listen to the Babel Text in mp3 format

Sasxsek

7-bit ASCII characters only.
All upper case or all lower case letters, no mixed case.
Unused punctuation symbols: ; " ? !
Unused letters: C, Y.
No doubled letters.
Empenthetic X (=/@/) used to in compounds.
Single bracket quotes: < >
Apostrophe to break up numbers or long words to make them more readable: 1'000'000
Colon used for abbreviations: k:m: (=kilxmitros)
Proper name marker "li".

http://www.nutter.net/sasxsek

Senjecas

Latin script

Non-ASCII consonants: ɱ; þ; ð; ł; ß; к; ħ; ʒ (yogh)
- Until yogh is made available on Wiki, I am using ʒ ezh.
- к = Kalaallisut (Greenlandic) kra
- On the conlang list: ħ = jh, к = k, yogh = j, ł = lh, ɱ = mh, r = rh, ß = dz
Breve under or over to indicate labialization: ğ, ð̬
- On the conlang list labialization is indicated by ü
Cedilla under or apostrophe over to indicate palatalization: ç, g̓
- On the conlang list palatalization is indicated by ï
Non-ASCII vowel: ø; all vowels with acute accent: í, é, á, ǿ, ó, ú; all vowels with double acute accent: i̋, e̋, a̋, ø̋, ő, ű
- On the conlang list the double acute accent is replaced with a circumflex
Non-ASCII weak vowels: ı, ɶ, æ
- On the conlang list ı = ï

Cyrillic script

Consonants: п/б - ф/в - м̀/м; т/д - o̴/ђ - љ/л; ц/s - c/з - p/н; к/г - x/ғ - ы/j
- o̴ is the old Cyrillic letter fita, an "o" with a tilde through it, derived from θ.[1]
- On the conlang list θ will be used.
Breve under or over to indicate labialization: г̆, б̬
Vowels: и, е, а, ѫ, o, у; with acute accent: и́, é, á, ѫ́, ó, ý; with double acute accent: и̋, e̋, a̋, ѫ̋, ő, y̋
- On the conlang list the double acute accent is replaced with a circumflex.
- On the conlang list ø will be used for ѫ yus.
Palatalization is indicated with the iotacized vowels: i, є, я, ё, ю; palatalization does not occur before ѫ. This is not a rule, it just so happens.
Weak vowels: ь, ў, ъ

Greek script

Consonants: π/? - φ/β - μ̀/μ; τ/δ - θ/? - ?/λ; ?/? - σ/ζ - ρ/ν; к/γ - χ/? - ?/?
Vowels: ι, η, α, o, ω, υ; with acute accent: ί, ή, ά, ό, ώ, ύ; with double acute accent: ι̋, η̋, α̋, ο̋, ω̋, υ̋
- On the conlang list the double acute accent is replaced with a circumflex
Weak vowels: ϊ, ϋ, ε

y α β γ δ ε ζ η θ ι κ λ μ ν ξ ο π ρ ς σ τ υ φ χ ψ ω ϊ ϋ ό ύ ώ

ϐ ϑ h ϙ ϡ

Other Scripts

The Armenian, Devanagari, Georgian, Hebrew and Tengwar alphabets have also been adapted for Senjecas.

Tatari Faran

Uses subset of Latin alphabet: a, b, d, e, f, h, i, j, k, m, n, o, p, r, s, t, u.
Unused letters: c, g, l, q, v, w, x, y, z.
No capitalization, not even in proper names.
Glottal stop in words, indicated by apostrophe (').
ts used as a digraph.
d is always word-initial, and r is always medial.
The only consonant clusters are double consonants beginning with m or n.
Common words: ka, kei, ko, sa, sei, so, na, nei, no, ei (never at the beginning of a sentence); e (never at the end of a sentence); da (always follows a word ending in -n).

Terzemian

Latin script

Non-ASCII: Åå Čč Ǧǧ Ňň Öö Šš Üü Žž
Unused ASCII: Jj Qq (except in foreign names)
Vowel Harmony groups:
- Aa Ee Ii Öö Üü
- Åå Oo Öö Uu Üü
- Aa Åå Ee Oo Öö
Sentences generally start with a word (the verb) beginning with a multi-consonant cluster
Verb may have a, e, or ö prefixed to the initial cluster

Cyrillic script

Non-Russian: Ғғ Ңң Өө Ўў Үү Һһ Ωω
Unused Russian: Ее Щщ Ъъ Ыы Ьь Юю Яя (Ыы sometimes used for non-harmonic or non-Terzemian vowels in foreign words)
Foreign names not originally written in Cyrillic may occur in Latin orthography
Vowel Harmony groups:
- Аа Ээ Ии Өө Үү
- Ωω Оо Өө Уу Үү
- Аа Ωω Ээ Оо Өө
Sentences generally start with a word (the verb) beginning with a multi-consonant cluster
Verb may have а, э, or ө prefixed to the initial cluster

Arabic script

Not in Standard Arabic: اً څ چ اِ گ ڽ اَ اَِ هَِ ۆ ژ
Standard but not used: ة ث ج ح ذ ص ض ط ظ ع ق ى
Vowel Harmony groups:
- ا اِ هِ اَِ هَِ
- اً اَ هَ هَِ
- ا اً اِ اَ اَِ
Sentences generally start with a word (the verb) beginning with a multi-consonant cluster
Verb may have اِِ ,ا, or اَِ prefixed to the initial cluster

Þrjótrunn

Non-ASCII: ÁÐÉÍÓÚÝÞÆÖáðéíóúýþæö
Unused ASCII: cqz
Combinations: pp tt kk gj ggj kj kkj
Frequent words: ún únn á í eð er þiss þissi þissa

Conlang Recognition Chart

Contents

Ayeri

Calénnawn

Ebisédian

Germanech

Minza

Qþyn|gài

Regimonti

Sasxsek

Senjecas

Latin script

Cyrillic script

Greek script

Other Scripts

Tatari Faran

Terzemian

Latin script

Cyrillic script

Arabic script

Þrjótrunn

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools