Unicode discussion
Advertisement

Microsoft's Kartika and other Malayalam Unicode compliant fonts(Anjali, Rachana) behave differently for following NA, RRA combinations. This introduces error in the texts being produced currently.

Image Description Correct(?) Encoding After chillus approved (in Unicode 5.1)
Nta Chillu NA stacked over subscript RRA <NA, VIRAMA, ZWJ, ZWJ, VIRAMA, RRA> or
<NA, VIRAMA, ZWJ, VIRAMA, RRA>
<NA, ZWJ, VIRAMA, RRA> or
<NA, VIRAMA, RRA>
N Ra Chillu NA followed by full RRA <NA, VIRAMA, ZWJ, RRA> or

<NA, VIRAMA, ZWJ, ZWNJ, RRA>
(similar to the current encoding of
N sa as <NA, VIRAMA, ZWJ, SA>)

<Chillu-N, RRA>
N~Ra NA, Visible virama, full RRA <NA, VIRAMA, RRA> or
<NA, VIRAMA, ZWNJ, RRA>
no changes

Current behavior of the fonts[]

Karthika produces Nta from <NA, VIRAMA, ZWJ, RRA>. Anjali and Rachana produces Nta from both <NA, VIRAMA, ZWJ, RRA> and <NA, VIRAMA, RRA>. None of them produce Nta from <NA, VIRAMA, ZWJ, ZWJ, VIRAMA, RRA>.

Advertisement