Charecter set Selection
DBAsupport.com Forums - Powered by vBulletin
Results 1 to 4 of 4

Thread: Charecter set Selection

  1. #1
    Join Date
    Jul 2002
    Location
    Washington DC
    Posts
    110

    Charecter set Selection

    Hi,

    We would like to develop product which should be able to operate from diffrent parts of world.I.e. it should support diffrent languages like Japanes,Chinese i.e. multibyte caracter set.I thought of installing the Oracle with UTF8 as national charecter set. Reamrks, description comments etc fields especially shoudl support diffrent languages.If I select UTF8 as the national charecter set will i get any advantage for decalring a datatype in table like VARCAHR2 instead of NVARCAHR2,I mean will it go for multybyte storage for singel charectar

    I went through soem links but I could not reach in a conclusion and regarding performance implication also.

    Any suggetions will be appreciated.

    Manoj

  2. #2
    Join Date
    Oct 2000
    Location
    Saskatoon, SK, Canada
    Posts
    3,925
    Here is a sample test case that will help you to undestand this:

    Code:
    Where 50050 is the LATIN CAPITAL LETTER A WITH CIRCUMFLEX
          14844588  is the EURO SIGN
    
        
    SQL> create table test (testcol varchar2(20));
    SQL> create table test1( testcol nvarchar2(20));
    
    SQL> insert into test values (chr(50050));
    SQL> insert into test values (chr(14844588));
    
    SQL> insert into test1 values (nchr(50050));
    SQL> insert into test1 values (nchr(14844588));
    
    SQL> commit;
    
    SQL> select dump(testcol) from test;
    
    DUMP(TESTCOL)
    --------------------------------------------------------------------------------
    Typ=1 Len=2: 195,130
    Typ=1 Len=3: 226,130,172
    
    SQL> select dump(testcol) from test1;
    
    DUMP(TESTCOL)
    --------------------------------------------------------------------------------
    Typ=1 Len=2: 195,130
    Typ=1 Len=3: 226,130,172
    
    SQL> select dump(testcol,1016) from test; 
    
    DUMP(TESTCOL,1016)
    --------------------------------------------------------------------------------
    Typ=1 Len=2 CharacterSet=UTF8: c3,82
    Typ=1 Len=3 CharacterSet=UTF8: e2,82,ac
    
    SQL>  select dump(testcol,1016) from test1;
    
    DUMP(TESTCOL,1016)
    --------------------------------------------------------------------------------
    Typ=1 Len=2 CharacterSet=UTF8: c3,82
    Typ=1 Len=3 CharacterSet=UTF8: e2,82,ac
    The Length will specify the # of Bytes it takes to store in the database. Now you can make your own judgement call

    -Sam
    Thanx
    Sam



    Life is a journey, not a destination!


  3. #3
    Join Date
    Oct 2003
    Posts
    312
    Could someone please show me where can I find "
    Where 50050 is the LATIN CAPITAL LETTER A WITH CIRCUMFLEX
    14844588 is the EURO SIGN
    "


    2. and why did Sam have the dump in front or the column
    3. Can you use the length(column) for UTF8 database????

  4. #4
    Join Date
    Oct 2003
    Posts
    312
    is there any way we can use the bytes based the below:

    QL> select dump(testcol,1016) from test;

    DUMP(TESTCOL,1016)
    --------------------------------------------------------------------------------
    Typ=1 Len=2 CharacterSet=UTF8: c3,82
    Typ=1 Len=3 CharacterSet=UTF8: e2,82,ac



    can we use the code c3, 82 and insert into the database based on these code????

    I don't have any tools for UNICODE compatible

    any advises???

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  


Click Here to Expand Forum to Full Width