Skip to content
Surf Wiki
Save to docs
general/persian-alphabets

From Surf Wiki (app.surf) — the open knowledge base

Persian alphabet

Writing system used for the Persian language

Persian alphabet

Summary

Writing system used for the Persian language

FieldValue
typeAbjad
timepresent
altnameالفبای فارسی
fa
sampleکتاب الابنیه عن حقایق الادویه، برگی از دستنویس محفوظ در کتابخانه مجلس در تهران.jpg
image size230px
captionA page from a 12th century manuscript of "Kitab al-Abniya 'an Haqa'iq al-Adwiya" by Abu Mansur Muwaffaq with special Persian letters p (پ), ch (چ) and g (گ = ڭـ).
languagesPersian, Mazanderani, Moghol, Qashqai
fam1Egyptian hieroglyphs
fam2Proto-Sinaitic
fam3Phoenician
fam4Aramaic
fam5Nabataean
fam6Arabic
*Arabi Malayalam<ref>{{Cite webdate2008-03-18title=THE ARABI - MALAYALAM SCRIPTUREurl=http://www.e-malabari.net/culture/arabimalayalam.htmaccess-date=2023-01-11archive-date=18 March 2008archive-url=https://web.archive.org/web/20080318235926/http://www.e-malabari.net/culture/arabimalayalam.htmurl-status=dead}} (indirectly)

fa

  • Adyghe Arabic
  • Arabi Malayalam (indirectly)
  • Arabic Afrikaans
  • Arwi (indirectly)
  • Avar Arabic
  • Azerbaijani Arabic
  • Belarusian Arabic
  • Chagatai
    • Dobrujan Tatar Arabic
    • Karachay-Balkar Arabic
    • Karakalpak Arabic
    • Kazakh Arabic⁹
    • Kumyk Arabic
    • Kyrgyz Arabic
    • İske imlâ
      • Yaña imlâ
    • Uyghur
      • Sarikoli (with some influence from Pashto orthography)
  • Chechen Arabic (indirectly)
  • Gilaki
  • Indo-Persian
    • Aer
    • Kashmiri
    • Marwari Arabic
    • Shahmukhi (Punjabi)
      • Hindko
      • Saraiki
    • Urdu
      • Balochi (indirectly)
      • Balti
      • Bangladeshi Arabic alphabet
      • Brahui
      • Burushaski
      • Gawar-Bati
      • Khowar
      • Kohistani Shina
      • Palula
      • Rohingya Arabic
      • Shina
      • Torwali
      • Wakhi (in Pakistan)
  • Jawi (indirectly)
  • Kabardian Arabic
  • Khorasani Turkic
  • Kurdo-Arabic
  • Lak Arabic
  • Lezgin Arabic
  • Luri
    • Northern Luri
    • Bakhtiari
    • Southern Luri
  • Pashto
    • Munji
    • Ormuri
    • Shughni Arabic
    • Wakhi (in Afghanistan)
  • Sindhi
    • Parkari Koli
  • Ottoman Turkish
    • Arebica
    • Albanian Arabic
    • Crimean Tatar Arabic (could instead be a descendant of Chagatai)
    • Greek Aljamiado
  • Turkmen Arabic
  • Xiao'erjing

The Persian alphabet (), also known as the Perso-Arabic script, is the right-to-left alphabet used for the Persian language. An Arabic-based alphabet, it is largely identical to the Arabic alphabet with four additional letters: پ چ ژ گ (the sounds 'g', 'zh', 'ch', and 'p', respectively), in addition to the obsolete ڤ that was used for the sound . This letter is no longer used in Persian, as the -sound changed to , e.g. archaic زڤان زبان 'language'. Although the sound (ڤ) is written as "و" nowadays in Farsi (Dari-Parsi/New Persian), it is different to the Arabic (و) sound, which uses the same letter.

It was the basis of many Arabic-based scripts used in Central and South Asia. It is used for both Iranian and Dari: standard varieties of Persian; and is one of two official writing systems for the Persian language, alongside the Cyrillic-based Tajik alphabet.

The script is mostly but not exclusively right-to-left; mathematical expressions, numeric dates and numbers bearing units are embedded from left to right. The script is cursive, meaning most letters in a word connect to each other; when they are typed, contemporary word processors automatically join adjacent letter forms. Persian is unusual among Arabic scripts because a zero-width non-joiner is sometimes entered in a word, causing a letter to become disconnected from others in the same word.

History

The Persian alphabet is directly derived and developed from the Arabic alphabet. The Arabic alphabet was introduced to the Persian-speaking world after the Muslim conquest of Persia and the fall of the Sasanian Empire in the 7th century. Following this, the Arabic language became the principal language of government and religious institutions in Persia, which led to the widespread usage of the Arabic script. Classical Persian literature and poetry were affected by this simultaneous usage of Arabic and Persian. A new influx of Arabic vocabulary soon entered the Persian language. In the 8th century, the Tahirid dynasty and Samanid dynasty officially adopted the Arabic script for writing Persian, followed by the Saffarid dynasty in the 9th century, gradually displacing the various Pahlavi scripts used for the Persian language earlier. By the 9th-century, the Perso-Arabic alphabet became the dominant form of writing in Greater Khorasan.

Under the influence of various Persian Empires, many languages in Central and South Asia that adopted the Arabic script use the Persian Alphabet as the basis of their writing systems. Today, extended versions of the Persian alphabet are used to write a wide variety of Indo-Iranian languages, including Kurdish, Balochi, Pashto, Urdu (from Classical Hindustani), Saraiki, Panjabi, Sindhi and Kashmiri. In the past the use of the Persian alphabet was common amongst Turkic languages, but today is relegated to those spoken within Iran, such as Azerbaijani, Turkmen, Qashqai, Chaharmahali and Khalaj. The Uyghur language in western China is the most notable exception to this.

During the Soviet period many languages in Central Asia, including Persian, were reformed by the government. This ultimately resulted in the Cyrillic-based alphabet used in Tajikistan today. See: .

Letters

Example showing the [[Nastaʿlīq]] calligraphic style's proportion rules

Below are the 32 letters of the modern Persian alphabet. Since the script is cursive, the appearance of a letter changes depending on its position: isolated, initial (joined on the left), medial (joined on both sides) and final (joined on the right) of a word. These include 28 letters of the Arabic alphabet, in addition to 4 other letters.

The names of the letters are mostly the ones used in Arabic except for the Persian pronunciation. The only ambiguous name is fa, which is used for both ح and ه. For clarification, they are often called fa (literally "fa-like fa" after fa, the name for the letter ج that uses the same base form) and fa (literally "two-eyed fa", after the contextual middle letterform ـهـ), respectively. There are eight Persian letters that are mainly used in Arabic or foreign loanwords and not in native words: ث, ح, ذ, ص, ض, ط, ظ, ع and غ. These eight letters are also commonly used only in proper names. Unlike Arabic, the Persian language does not have pharyngealization at all. Although the letter غ is mainly used in Arabic loanwords, there are some native Persian words with this letter: آغاز, زغال, etc. The pronunciation of these letters in Persian can differ from their pronunciation in Arabic. For example, the letter ث is pronounced as in Persian, while it is pronounced as in Arabic.

LetterPersianArabic
/s//θ/
/h//ħ/
/z//ð/
ص/s//sˤ/
ض/z//dˤ/
ط/t//tˤ/
ظ/z//ðˤ/
ع/ʔ//ʕ/
غor/ɣ/

Overview table

#Name
(in Persian)Name
(transliterated)TransliterationIPAUnicodeContextual formsFinalMedialInitialIsolated
0همزهfasemGlottal stop[U+0621]()ء
[U+0623]()ـأأ
U+0626ـئـئـئـئ
U+0624ـؤؤ
1الفfasem[U+0627]()ـاا
2بfasemU+0628ـبـبـبـب
3پfasemU+067Eـپـپـپـپ
4تfasemU+062Aـتـتـتـت
5ثfasem / semU+062Bـثـثـثـث
6جیمfasem / semU+062Cـجـجـجـج
7چfasemU+0686ـچـچـچـچ
8حfa (fa)sem / semU+062Dـحـحـحـح
9خfasemU+062Eـخـخـخـخ
10دالfasemU+062Fـدد
11ذالfasem / semU+0630ـذذ
12رfasemU+0631ـرر
13زfasemU+0632ـزز
14ژfasemU+0698ـژژ
15سینfasemU+0633ـسـسـسـس
16شینfasemU+0634ـشـشـشـش
17صادfasem / semU+0635ـصـصـصـص
18ضادfasem / semU+0636ـضـضـضـض
19طاfasem / semU+0637ـطـطـطـط
20ظاfasem / semU+0638ـظـظـظـظ
21عینfasem, /[U+0639]()ـعـعـعـع
22غینfasem,U+063Aـغـغـغـغ
23فfasemU+0641ـفـفـفـف
24قافfasemU+0642ـقـقـقـق
25کافfasemU+06A9ـکـکـکـک
26گافfasemU+06AFـگـگـگـگ
27لامfasemU+0644ـلـلـلـل
28میمfasemU+0645ـمـمـمـم
29نونfasemU+0646ـنـنـنـن
30واوpes (in Farsi)sem / sem / sem / sem, , , (only word-finally)U+0648ـوو
prs (in Dari)sem / sem / sem / sem, , ,
31هfa (fa)sem, or and (word-finally)U+0647ـهـهـهـه
32یfasem / sem / sem / (Also sem / sem in Dari), , ( / in Dari)U+06CCـیـیـیـی

Historically, in Early New Persian, there was a special letter for the sound . This letter is no longer used, as the -sound changed to , e.g. archaic زڤان /zaβān/ زبان 'language'.

Name
(in Persian)Name
(transliterated)TransliterationSoundIsolated formFinal formMedial formInitial form
ڤvesem / sem / semڤـڤـڤـڤـ

Another obsolete variant of the twenty-sixth letter گ is ݣ‎ which used to appear in old manuscripts.

SoundIsolated formFinal formMedial formInitial formName
ݣ‎ـݣ‎ـݣـ‎ڭـfa

Another obsolete variant of the twenty-fifth letter ک is ك‎ which used to appear in old manuscripts.

SoundIsolated formFinal formMedial formInitial formName
ك‎‎ـكـكـ‎كـfa

The archaic letter ݿ was also used as a substitute for the twenty-sixth letter of the Persian alphabet, گ, which was used to appear in the older manuscripts of Persian in the late 18th century to the early 19th century.

SoundIsolated formFinal formMedial formInitial formName
ݿ‎‎ـݿـݿـ‎ݿـfa

Variants

Letter construction

formsisolatedstartmidendi'jam--UnicodeUnicodeUnicodeUnicodeUnicodeUnicodeUnicodeUnicodeUnicodeUnicodeUnicodeUnicode
[ء ]()[ا ]()[ى ]()ں ٮ ح س ص ط [ع ]()ڡ ٯ کل م د ر و ه
ءاٮـٮـٮـ --حـسـصـطـعـڡـٯـ --کـلـمـدروهـ
ءـاـٮــٮــٮـ --ـحــســصــطــعــڡــٯـ --ـکــلــمــدـرـوـهـ
ءـاـىـںـٮـحـسـصـطـعـڡـٯـکـلـمـدـرـوـه
0621 ..0627 ..0649 ..06BA . .066E ..062D ..0633 ..0635 ..0637 ..0639 ..06A1 ..066F ..066F ..0644 ..0645 ..062F ..0631 ..0648. ..0647 ..
1 dot below{{uninastaliq﮳ }} --بج
FBB3.0628 . .062C ..
1 dot above{{uninastaliq﮲ }} --نخضظغفذ
FBB2.0646 ..062E ..0636 ..0638 ..063A ..0641 . .0630 ..0632 ..
2 dots below{{uninastaliq﮵ }} --ی
FBB5.06CC ..
2 dots above{{uninastaliq﮴ }} --تق
FBB4.062A ..0642 ..
3 dots below{{uninastaliq﮹ }} --پچ
FBB9. FBB7.067E ..0686 ..
3 dots above{{uninastaliq﮶ }} --ثش
FBB6.062B ..0634 ..0698 ..
line above{{uninastaliq‾ }} --گ
203E.06AF ..
noneءایںحسصطعکلمدرو
none --0621 ..0627 ..0649 ..06BA . .062D ..0633 ..0635 ..0637 ..0639 ..066F ..0644 ..0645 ..062F ..0631 ..0648. ..
madda above{{uninastaliqۤ }} --ۤآ
06E4. 0653.0622 ..
Hamza below{{uninastaliqٕ }} --ــٕـإ
0655.0625 ..
Hamza above{{uninastaliqٴ }} --ــٔـأئ
0674. 0654.0623 ..0626 ..0624 ..

The i'jam diacritic characters are illustrative only; in most typesetting the combined characters in the middle of the table are used.

Persian has 2 dots below in the initial and middle positions only. The standard Arabic version ي يـ ـيـ ـي always has 2 dots below.

Seven letters (و, ژ, ز, ر, ذ, د, ا) do not connect to the following letter, unlike the rest of the letters of the alphabet. The seven letters have the same form in isolated and initial position and a second form in medial and final position. For example, when the letter ا fa is at the beginning of a word such as اینجا fa ("here"), the same form is used as in an isolated fa. In the case of امروز fa ("today"), the letter ر fa takes the final form and the letter و fa takes the isolated form, but they are in the middle of the word, and ز also has its isolated form, but it occurs at the end of the word.

Diacritics

Persian script has adopted a subset of Arabic diacritics: fa (ar in Arabic), fa (ar in Arabic), and fa or (ar in Arabic, pronounced pes in Western Persian), fa and fa (gemination). Other Arabic diacritics may be seen in Arabic loanwords in Persian.

180
Nastaliq [[Persian Calligram]] the Persian letter [[Mem

Short vowels

Of the four Arabic diacritics, the Persian language has adopted the following three for short vowels. The last one, sukūn, which indicates the lack of a vowel, has not been adopted.

Short vowels
(fully vocalized text)Name
(in Persian)Name
(transliterated)Trans.Value
064E
زبر
(فتحه)fa/fafa
0650
زیر
(کسره)fa/fafa; fa
064F
پیش
(ضمّه)fa/fafa; fa

There is no standard transliteration for Persian. The letters 'i' and 'u' are only ever used as short vowels when transliterating Dari or Tajik Persian. See Persian Phonology

Diacritics differ by dialect, due to Dari having 8 distinct vowels compared to the 6 vowels of Farsi. See Persian Phonology

In Farsi, none of these short vowels may be the initial or final grapheme in an isolated word, although they may appear in the final position as an inflection, when the word is part of a noun group. In a word that starts with a vowel, the first grapheme is a silent fa which carries the short vowel, e.g. اُمید (fa, meaning "hope"). In a word that ends with a vowel, letters ع, ه and و respectively become the proxy letters for fa, fa and fa, e.g. نو (fa, meaning "new") or بسته (fa, meaning "package").

Tanvin (nunation)

Main article: Nunation

Nunation (, tanvin) is the addition of one of three vowel diacritics to a noun or adjective to indicate that the word ends in an alveolar nasal sound without the addition of the letter nun.

Nunation
(fully vocalized text)Name
(in Persian)Name
(transliterated)Notes
064B
تنوین نَصْبْfa
064D
تنوین جَرّfaNever used in the Persian language.
064C
تنوین رَفْعْfa

Tašdid

Main article: Shadda

SymbolName
(in Persian)Name
(transliteration)
0651
تشدیدfa

Other characters

The following are not actual letters but different orthographical shapes for letters, a ligature in the case of the fa. As to ﺀ (hamza), it has only one graphical form since it is never tied to a preceding or following letter. However, it is sometimes 'seated' on a fa, fa or fa, and in that case, the seat behaves like an ordinary fa, fa or fa respectively. Technically, hamza is not a letter but a diacritic.

NamePronunciationIPAUnicodeFinalMedialInitialStand-aloneNotes
semsemU+0622ـآآThe final form is very rare and is freely replaced with ordinary alef.
semsem or semU+06C0ـۀۀValidity of this form depends on region and dialect. Some may use the two-letter ـه‌ی or ه‌ی combinations instead.
semsemU+0644 (lām) and U+0627 (alef)ـلالا
semU+0640[ـ]()This is the medial character which connects other characters

Although at first glance, they may seem similar, there are many differences in the way the different languages use the alphabets. For example, similar words are written differently in Persian and Arabic, as they are used differently.

Unicode has accepted in the Miscellaneous Symbols range. In Unicode 1.0 this symbol was known as . It is a stylization of الله (fa) used as the emblem of Iran. It is also a part of the flag of Iran.

The Unicode Standard has a compatibility character defined that can represent ریال, the Persian name of the currency of Iran.

Novel letters

The Persian alphabet has four extra letters that are not in the Arabic alphabet: , (ch in chair), (s in measure), . An additional fifth letter ڤ was used for (v in Spanish huevo) but it is no longer used.

SoundShapeNameUnicode code point
پfaU+067E
(ch)چfaU+0686
(zh)ژfaU+0698
گfaU+06AF

Deviations from the Arabic script

Persian numerals Persian uses the Eastern Arabic numerals, but the shapes of the digits 'four' (۴), 'five' (۵), and 'six' (۶) are different from the shapes used in Arabic. All the digits also have different codepoints in Unicode:

Hindu-ArabicPersianNameUnicodeArabicUnicode
0۰صفرU+06F0٠U+0660
1۱يکU+06F1١U+0661
2۲دوU+06F2٢U+0662
3۳سهU+06F3٣U+0663
4۴چهارU+06F4٤U+0664
5۵پنجU+06F5٥U+0665
6۶ششU+06F6٦U+0666
7۷هفتU+06F7٧U+0667
8۸هشتU+06F8٨U+0668
9۹نهU+06F9٩U+0669
-یyeU+06CCيU+064A
کkâfU+06A9كU+0643

Comparison of different numerals

Abjad numeralsابجدهوزحطي

Word boundaries

Typically, words are separated from each other by a space. Certain morphemes (such as the plural ending '-hâ'), however, are written without a space. On a computer, they are separated from the word using the zero-width non-joiner.

Cyrillic Persian alphabet in Tajikistan

As part of the russification of Central Asia, the Cyrillic script was introduced in the late 1930s. The alphabet has remained Cyrillic since then. In 1989, with the growth in Tajik nationalism, a law was enacted declaring Tajik the state language. In addition, the law officially equated Tajik with Persian, placing the word Farsi (the endonym for the Persian language) after Tajik. The law also called for a gradual reintroduction of the Perso-Arabic alphabet.

The Persian alphabet was introduced into education and public life, although the banning of the Islamic Renaissance Party in 1993 slowed adoption. In 1999, the word Farsi was removed from the state-language law, reverting the name to simply Tajik. the de facto standard in use is the Tajik Cyrillic alphabet, and only a very small part of the population can read the Persian alphabet.

References

References

  1. (2008-03-18). "THE ARABI - MALAYALAM SCRIPTURE".
  2. "PERSIAN LANGUAGE i. Early New Persian". [[Iranica Online]].
  3. Orsatti, Paola. (2019). "Persian Language in Arabic Script: The Formation of the Orthographic Standard and the Different Graphic Traditions of Iran in the First Centuries of the Islamic Era". Creating Standards (Book).
  4. Lapidus, Ira M.. (2012). "Islamic Societies to the Nineteenth Century: A Global History". [[Cambridge University Press]].
  5. Lapidus, Ira M.. (2002). "A History of Islamic Societies". [[Cambridge University Press]].
  6. Ager, Simon. "Persian (Fārsī / فارسی)".
  7. "ویژگى‌هاى خطّ فارسى". [[Academy of Persian Language and Literature]].
  8. "??". Persianacademy.ir.
  9. "PERSIAN LANGUAGE i. Early New Persian". Iranica Online.
  10. [http://www.unicode.org/charts/PDF/U2600.pdf#page=4 "Miscellaneous Symbols"]. p. 4. ''The Unicode Standard, Version 13.0''. Unicode.org
  11. [https://www.unicode.org/versions/Unicode1.0.0/CodeCharts2.pdf#page=155 "3.8 Block-by-block Charts"] § Miscellaneous Dingbats p. 325 (155 electronically). ''The Unicode Standard Version 1.0''. Unicode.org
  12. Pournader, Roozbeh. (2001-09-20). ["Proposal to add Arabic Currency Sign Rial to the UCS"](http://std.dkuug.dk/jtc1/sc2/wg2/docs/n2373.pdf}} It proposes the character under the name of {{sc2).
  13. "Unicode Characters in the 'Number, Decimal Digit' Category".
  14. Hämmerle, Christa. (2008). "Gender Politics in Central Asia: Historical Perspectives and Current Living Conditions of Women". Böhlau Verlag Köln Weimar.
  15. Cavendish, Marshall. (September 2006). "World and Its Peoples". Marshall Cavendish.
  16. (2001). "Politics of Language in the Ex-Soviet Muslim States: Azerbayjan, Uzbekistan, Kazakhstan, Kyrgyzstan, Turkmenistan, and Tajikistan". University of Michigan Press.
  17. Buyers, Lydia M.. (2003). "Central Asia in Focus: Political and Economic Issues". Nova Publishers.
  18. Ehteshami, Anoushiravan. (1994). "From the Gulf to Central Asia: Players in the New Great Game". University of Exeter Press.
  19. Malik, Hafeez. (1996). "Central Asia: Its Strategic Importance and Future Prospects". St. Martin's Press.
  20. (1994). "The New Geopolitics of Central Asia and Its Borderlands". Indiana University Press.
  21. (1999). "Islam Outside the Arab World". St. Martin's Press.
  22. (1995). "Oil in the New World Order". University Press of Florida.
  23. Badan, Phool. (2001). "Dynamics of Political Development in Central Asia". Lancers' Books.
  24. Winrow, Gareth M.. (1995). "Turkey in Post-Soviet Central Asia". Royal Institute of International Affairs.
  25. Parsons, Anthony. (1993). "Central Asia, the Last Decolonization". David Davies Memorial Institute.
  26. (1990). "Report on the USSR.". RFE/RL, Incorporated.
  27. (1990). "Middle East Monitor". Middle East Institute.
  28. (2010-01-06). "The Middle East: A History". McGraw-Hill Education.
  29. (2009). "Worldmark Encyclopedia of Cultures and Daily Life". Gale.
Wikipedia Source

This article was imported from Wikipedia and is available under the Creative Commons Attribution-ShareAlike 4.0 License. Content has been adapted to SurfDoc format. Original contributors can be found on the article history page.

Want to explore this topic further?

Ask Mako anything about Persian alphabet — get instant answers, deeper analysis, and related topics.

Research with Mako

Free with your Surf account

Content sourced from Wikipedia, available under CC BY-SA 4.0.

This content may have been generated or modified by AI. CloudSurf Software LLC is not responsible for the accuracy, completeness, or reliability of AI-generated content. Always verify important information from primary sources.

Report