Arabic letter frequency

No language has an exact letter frequency distribution, as all writers write slightly differently.

If these 8 modified forms are folded into the primary list based on shape or phonetic similarity, the outcome then is as shown in Table 2.

The ordering of the alphabet shown in the tables is more logical[citation needed] than is used by the Unicode standard.

Although the full set of Arabic characters includes about ten diacritics as shown in the Figure 1, frequency analysis of Arabic characters is only concerned with computing the frequency of alphabet letters shown in Table 2.

The following Arabic sources are used to generate an acceptable amount of data on which frequency statistics are conducted.

Figure 1: Arabic characters that can be produced using the Arabic Letter Keyboard Intellark .
Table 1: The Arabic alphabet. Letters 1 to 28 are the primary letters. Letters 29 to 36 are the modified letters.
Table 2: The Arabic alphabet, with modified letters lumped onto their primary forms.
Letter frequency distribution for the counted letters: Histogram data sorted on frequency.