These are some of the sample set of Character sets supported under UTF8 format;
Code:
The Unicode Standard defines codes for characters used
in every major language written today.
It includes scripts:
Latin
Greek
Cyrillic
Armenian
Hebrew
Arabic
Devanagari
Bengali
Gurmukhi
Gujarati
Oriya
Tamil
Telugu
Kannada
Malayalam
Thai
Lao
Georgian
Tibetan
Japanese kana
Complete set of modern Korean hangul
Unified set of Chinese/Japanese/Korean (CJK) ideographs.
The character set name for UTF-8 is
AL24UTFFSS for UNICODE Version 1.1
UTF8 for UNICODE Version 2.0.
Hope this will help
Sam
Thanx
Sam
Life is a journey, not a destination!