[Ilugc] UTF-8 and indic languages

Ramanraj K ramanraj.k at gmail.com
Sun Nov 19 08:18:04 IST 2006


Prashanth Mohan wrote:
> Ramanraj K wrote:
>   
>> Tamil and other indic scripts are 3 bytes per character if encoded
>> with UTF-8. That appears to be expensive.  Unicode says we could
>> compress with algorithms but it would make every processor along the way
>> crawl needlessly.
>>     
>
> Could someone give a tiny intro to the Problems faced by the
> localization community? GLV/Monthly meet?
>   

I would like to hear what Hariram Atreya has to say on this.


More information about the ilugc mailing list