[Ilugc] UTF-8 and indic languages
ramanraj.k at gmail.com
Sun Nov 19 08:18:04 IST 2006
Prashanth Mohan wrote:
> Ramanraj K wrote:
>> Tamil and other indic scripts are 3 bytes per character if encoded
>> with UTF-8. That appears to be expensive. Unicode says we could
>> compress with algorithms but it would make every processor along the way
>> crawl needlessly.
> Could someone give a tiny intro to the Problems faced by the
> localization community? GLV/Monthly meet?
I would like to hear what Hariram Atreya has to say on this.
More information about the ilugc