Professional Documents
Culture Documents
> C X
> C X
> - C O
> - C X
t t t >t t t t -t t t t t t Ct t t Ot
t t t >t t t t -t t t t t t Ct X t X
> X X - X C X X X
t t t >t X t t -t t t t X t Ct X X X
27
6.3.2.2 Consonant and Mtr +Nasal combinations.
This set refers to a Consonant and Mtr + Nasal marker combinations.
Consonant and Mtr + Nasal combinations - Set 1
3 - " O ^
3 - X O ^ X
t 3t t -t t X t Ot t X
l l3 l l- X X l X l^ l X
l 3l l -l l X l Ol l X
X X X X X X O X X
3 - X O ^ X
3 - X O ^ X
3 - X O ^ X
3 X X X X X X X X
t 3 - X O ^ X
t 3 - " O ^ X
3 - " O ^ X
t 3 - " O ^
28
Consonant and Mtr +Nasal combinations - Set 2
This set is in continuation of set 1 above which shows combinations of Consonant and
Mtr + Nasal marker
8 ' 8 Q d
8 ' 8 Q d
t 8t t 't 8t Qt dt t t t t
l l8 l l' l8 lQ ld X l l l
l 8l l 'l 8l Ql dl l l l l
X X X X X X X X X X
8 ' 8 d
8 ' 8 d
8 ' 8 Q d
8 ' 8 Q d
t 8 ' 8 X d
t 8 ' 8 Q d
8 ' 8 Q d
t 8 ' 8 Q d
Consonant and Mtr +Nasal combinations - Set 3
This set is in continuation of set 2 above which shows combinations of Consonant and
Mtr + Nasal marker
> - C
7
O
8
> - C X X
t t t >t t t t -t t t t t t Ct t t Ot
l l l l> l l X l- l l l l l lC l X X
7
Inserted by expert although this is not a single consonant but a ligature
8
Inserted by expert although this is not a single consonant but a ligature
29
l l l >l l l X -l l l l l l Cl l X X
X X X X X X X X X X X
> C X X
> X C X X
> X - C X X
> X X - C X X
t > X - C X X
t > X - C X X
> - C X X
t > - C X X
Consonant and Mtr + Nasal combinations: With Chandrabindu
Since Chandrabindu is rarely used in Gujarati, the experts have deemed the same as
invalid
30
6.3.3. The Ligature Set of Gujarati.
Gujarati has a large set of ligatural forms. These are combinations of
Consonant+Halanta+Consonant (CHC) or CHCHC or even rarer CHCHCHC. The CHC
combinations which are the most frequent are arranged in the shape of a matrix: the
abscissa or horizontal axis refers to the Consonant which constitutes the ligature and the
ordinate or vertical axis shows the consonant which forms the ligature and which is
followed by a halanta.
As in 6.3.2. the ligature sets are divided into the following
6.3.3.1 CHC (in a matrix)
6.3.3.2 CHCHC
6.3.3.3.CHCHCHC
6.3.3.1. CHC ( combination of two consonanats)
These ligatures are presented as in the earlier case of Consonant+Mtr combinations in
three sets. A lot of slots have an X marked, showing that the experts have deemed that
such a ligature is not possible in the language. However in these cases, the font
developer is to assume that the ligature is linear in nature.
The following set shows a combination of two consonants. To know how particular
combinations forms, select one consonant from the first column and second from first
row. For eg. Combination of consonant 3 and 3 is ligature .
CHC( combination of two consonants) - Set 1
3 - " O ^
g 3 X X X 3 3O X 3 X
X X X X X X ^ X X
_ X X -- - X X X -^ - X
q X X X X X X X X X
X X X X X X X X X X
_ X X X X X O X X X
X X X X X X X X X
_ X X X X X X X ^ X
( X X X X X X X X X
X X X X X X X X X X
3 X X X X X X X X
31
X X X X X X X X X X
{ X X {- X X X X {^ { X
q X X X X X X X X X X
_ X X X X X X X X X X
q 3 X X X X X X X
q X X X X X X X X X X
q X X q - q X X X X X X
q X X X X X X X X X X
_ 3 X - X X X ^ X
} 3 X X X X X X X X
g X X X X X X X ^ X X
X X X X X X X >^ > X
X X X X X X X X X X
X X X X X X X X X X
q X X X X X X X X X X
3 X - X X X ^ X X
Q X X X X X X X X X X
_ X X X X X X X -^ - X
-3 - X X X - X X X X
j 3 - X X X X X X X
-3 - X X X - X -^ X X
X X X X X X X X X X
32
CHC Set 2:
The following set shows a combination of two consonants. To know how particular
combinations forms, select one consonant from the first column and second from first
row. For eg. Combination of consonant g and 8 is ligature 38.
CHC( combination of two consonants) - Set 2
8 ' 8 Q d
g 38 X X X X 3d 3 X X 3
X X X X X d X X X X
_ X X X X -Q X X - - -
q X X X X X X X X X
X X X X X X X X X X
_ X X X X X -d X X X X
X X X X X X X X X X
_ X X X X X X X X X
( X X ' X X X X X X
X X X X X X X X X X
' X X X X X X X
X X X X X X X X X
{ X X { 8 X X X { X X
q X X X X X X X X X
_ X X X X X X X X X X
q X X X X X X X
q X X X X X X X X X
q X X X X X X X q q X
q X X X X X X X X - -
_ 8 X ' X X d
} 8 X X X X d X X X
33
g 8 X X X X d X X X
X X >' X X X X > > >
X X X X X X X X X X
X X X X Q d X X X
q X X X X X X X X X X
8 X ' X X d X
Q X X X X X X X X X X
_ X X -' X X X X X X X
X X X X X -d X X X -
j X X Q X X X X X
-8 X -' X X -d - - X -
X X X X X X X X X c
CHC SET 3:
The following set shows a combination of two consonants. To know how particular
combinations forms, select one consonant from the first column and second from first
row. For eg. Combination of consonant g and is the ligature .
CHC( combination of two consonants) - Set 3
> - C
g X X 3 3 z 3 3 3 X 3 X
X X X X X X X
_ - X -> - - - - - X X - X
q X X X X X X X X X X
X X X X { X X X X X X X X
_ X X X X X > X X X X X X
X X X X X O X X X X X
_ X X X X X X X X X
34
( X X > X X X X X X X
X X X X X X X X X X X X X
X X X X X X
X X X X X X X X X X X X
{ X X X { { X { X X { {C
q X X X X X ) X X X X X X
_ X X X _ _ _ X X X X X X X
q > X ? X X
q X X X X X X X X X X X
q q X q> q _ X X X X qC
q X X X X - - > X - X X X X
_ X X X X X C
} X X X X X C
g X X X X X g X X X X X
X X >> > X > > > X X > >C
X X X X X X X X X
> X X C
q X X X X X - X X X X X X X
> X X C
Q X X X X X ! X X X X X X X
_ X X X X X - X - X X X -C
- - -> - - - - X X -C
j X X X X X X X
- - -> X - - - - X X - X
X X X X /
X X cC
35
6.3.3.2 CHCHC ( combination of three consonanats)
These are not as frequent as the CHC combinations. Only the major are listed below.
With a few exceptions these are mainly linear in nature
Q O
z
- - q- _
O O _
_ - - -
z -z
- -
6.3.3.3.CHCHCHC ( Combination of four Consonanats)
This cluster is rare in a majority of languages and the experts have deemed that it is not
found in Gujarati
6.3.4 The Collation Order of Gujarati.
Collation is one of the most important features of a script grammar. It determines the
order in which a given culture indexes its characters. This is best seen in a dictionary sort
where for easy search words are sorted and arranged in a specific order. Within a given
script, each allo-script may have a different sort-order. Thus in Devanagari the conjunct
glyph is sorted along with , since the first letter of that conjunct is and on a similar
principle is sorted along with . Different scripts admit different sort orders and for all
high-end NLP applications, sort is a crucial feature to ensure that the applications index
data as per the cultural perception of that community. In quite a few States, sort order is
clearly defined by the statutory bodies of that state and hence it is crucial that such sort
order be ascertained and introduced in the script grammar.
In the case of Gujarati the following is the traditional sort order as determined by the
experts. The order as given below is pertinent to sorting by a computer program and is
compliant with CLDR as laid down by Unicode and W3C.
36
- - U 3
- - - - - - 3 - "
O ^ 8 ' 8 Q
d >
- C
t l l t t t
In Tabular format:
- - U 3
- - - - - - 3 - "
O ^ 8 ' 8 Q
d >
- C
t l l t t t
1 are used only for Sanskrit Loans
37
7. REFERENCES
1. http://www.unicode.org
2. ISCII91
38
8. ANNEXURES
Annexure 1: Names of experts who have contributed to the script grammar
39
Annexure 2: Unicode Table of Gujarati
9
9
The Unicode chart provided is for version 5.1 since the Script Grammar was prepared at that time. No
considerable change in the script grammar can be seen in the updated versions of Unicode, with the
possible addition of the Rupee Sign U+02B9
40