Skip Headers
Oracle® Database Globalization Support Guide
10g Release 2 (10.2)

Part Number B14225-02
Go to Documentation Home
Home
Go to Book List
Book List
Go to Table of Contents
Contents
Go to Index
Index
Go to Master Index
Master Index
Go to Feedback page
Contact Us

Go to previous page
Previous
Go to next page
Next
View PDF

A Locale Data

This appendix lists the languages, territories, character sets, and other locale data supported by the Oracle server. It includes these topics:

You can obtain information about character sets, languages, territories, and linguistic sorts by querying the V$NLS_VALID_VALUES dynamic performance view.

See Also:

Oracle Database Reference for more information about the data that can be returned by this view

Languages

Languages in Table A-1 provide support for locale-sensitive information such as the following:

By using Unicode databases and datatypes, you can store, process, and retrieve data for almost all contemporary languages, including many that do not appear in Table A-1.

Table A-1 Oracle Supported Languages

Language Name Language Abbreviation Default Sort

AMERICAN

us

binary

ARABIC

ar

ARABIC

ASSAMESE

as

binary

AZERBAIJANI

az

AZERBAIJANI

BANGLA

bn

binary

BRAZILIAN PORTUGUESE

ptb

WEST_EUROPEAN

BULGARIAN

bg

BULGARIAN

CANADIAN FRENCH

frc

CANADIAN FRENCH

CATALAN

ca

CATALAN

CROATIAN

hr

CROATIAN

CYRILLIC KAZAKH

ckk

GENERIC_M

CYRILLIC SERBIAN

csr

GENERIC_M

CYRILLIC UZBEK

cuz

GENERIC_M

CZECH

cs

CZECH

DANISH

dk

DANISH

DUTCH

nl

DUTCH

EGYPTIAN

eg

ARABIC

ENGLISH

gb

binary

ESTONIAN

et

ESTONIAN

FINNISH

sf

FINNISH

FRENCH

f

FRENCH

GERMAN DIN

din

GERMAN

GERMAN

d

GERMAN

GREEK

el

GREEK

GUJARATI

gu

binary

HEBREW

iw

HEBREW

HINDI

hi

binary

HUNGARIAN

hu

HUNGARIAN

ICELANDIC

is

ICELANDIC

INDONESIAN

in

INDONESIAN

ITALIAN

i

WEST_EUROPEAN

JAPANESE

ja

binary

KANNADA

kn

binary

KOREAN

ko

binary

LATIN AMERICAN SPANISH

esa

SPANISH

LATIN SERBIAN

lsr

binary

LATIN UZBEK

luz

GENERIC_M

LATVIAN

lv

LATVIAN

LITHUANIAN

lt

LITHUANIAN

MACEDONIAN

mk

binary

MALAY

ms

MALAY

MALAYALAM

ml

binary

MARATHI

mr

binary

MEXICAN SPANISH

esm

WEST_EUROPEAN

NORWEGIAN

n

NORWEGIAN

ORIYA

or

binary

POLISH

pl

POLISH

PORTUGUESE

pt

WEST_EUROPEAN

PUNJABI

pa

binary

ROMANIAN

ro

ROMANIAN

RUSSIAN

ru

RUSSIAN

SIMPLIFIED CHINESE

zhs

binary

SLOVAK

sk

SLOVAK

SLOVENIAN

sl

SLOVENIAN

SPANISH

e

SPANISH

SWEDISH

s

SWEDISH

TAMIL

ta

binary

TELUGU

te

binary

THAI

th

THAI_DICTIONARY

TRADITIONAL CHINESE

zht

binary

TURKISH

tr

TURKISH

UKRAINIAN

uk

UKRAINIAN

VIETNAMESE

vn

VIETNAMESE


Translated Messages

Oracle error messages have been translated into the languages which are listed in Table A-2.

Table A-2 Oracle Supported Messages

Name Abbreviation

ARABIC

ar

BRAZILIAN PORTUGUESE

ptb

CATALAN

ca

CZECH

cs

DANISH

dk

DUTCH

nl

FINNISH

sf

FRENCH

f

GERMAN

d

GREEK

el

HEBREW

iw

HUNGARIAN

hu

ITALIAN

i

JAPANESE

ja

KOREAN

ko

NORWEGIAN

n

POLISH

pl

PORTUGUESE

pt

ROMANIAN

ro

RUSSIAN

ru

SIMPLIFIED CHINESE

zhs

SLOVAK

sk

SPANISH

e

SWEDISH

s

THAI

th

TRADITIONAL CHINESE

zht

TURKISH

tr


Territories

Table A-3 lists the territories supported by the Oracle server.

Table A-3 Oracle Supported Territories

Name Name Name

ALGERIA

GREECE

POLAND

AMERICA

HONG KONG

PORTUGAL

ARGENTINA

HUNGARY

PUERTO RICO

AUSTRALIA

ICELAND

QATAR

AUSTRIA

INDIA

ROMANIA

AZERBAIJAN

INDONESIA

RUSSIA

BAHRAIN

IRAQ

SAUDI ARABIA

BANGLADESH

IRELAND

SERBIA AND MONTENEGRO

BELGIUM

ISRAEL

SINGAPORE

BRAZIL

ITALY

SLOVAKIA

BULGARIA

JAPAN

SLOVENIA

CANADA

JORDAN

SOMALIA

CATALONIA

KAZAKHSTAN

SOUTH AFRICA

CHILE

KOREA

SPAIN

CHINA

KUWAIT

SUDAN

COLOMBIA

LATVIA

SWEDEN

COSTA RICA

LEBANON

SWITZERLAND

CROATIA

LIBYA

SYRIA

CYPRUS

LITHUANIA

TAIWAN

CZECH REPUBLIC

LUXEMBOURG

THAILAND

DENMARK

MALAYSIA

THE NETHERLANDS

DJIBOUTI

MAURITANIA

TUNISIA

ECUADOR

MEXICO

TURKEY

EGYPT

MOROCCO

UKRAINE

EL SALVADOR

NEW ZEALAND

UNITED ARAB EMIRATES

ESTONIA

NICARAGUA

UNITED KINGDOM

FINLAND

NORWAY

UZBEKISTAN

FRANCE

OMAN

VENEZUELA

FYR MACEDONIA

PANAMA

VIETNAM

GUATEMALA

PERU

YEMEN

GERMANY

PHILIPPINES



Character Sets

Oracle-supported character sets are listed in the following sections according to three broad categories.

In addition, common character set subset/superset combinations are listed. Some character sets can only be used with certain data types. For example, the AL16UTF16 character set can only be used as an NCHAR character set, and not as a database character set.

Also documented in the comment section are other unique features of the character set that may be important to users or your database administrator. For example, the information includes whether the character set supports the euro currency symbol, whether user-defined characters are supported, and whether the character set is a strict superset of ASCII. (You can use the CSALTER script to migrate an existing database to a new character set, only if all of the schema data is a strict subset of the new character set.)

The following is the key for the comment column of the character set tables:

SB: single-byte encoding
MB: multibyte encoding
FIXED: fixed-width multibyte encoding
ASCII: strict superset of ASCII
EURO: euro symbol supported
UDC: user-defined characters supported

Oracle does not document individual code page layouts. For specific details about a particular character set, its character repertoire, and code point values, you can use Oracle Locale Builder. Otherwise, you should refer to the actual national, international, or vendor-specific standards.

Recommended Database Character Sets

Table A-4 lists the recommended and most commonly used ASCII-based Oracle database character sets. The list is ordered alphabetically within their respective language group.

Table A-4 Recommended ASCII Database Character Sets


Name Description Comments

Asian





JA16EUC

EUC 24-bit Japanese

MB, ASCII


JA16EUCTILDE

The same as JA16EUC except for the way that the wave dash and the tilde are mapped to and from Unicode.

MB, ASCII


JA16SJIS

Shift-JIS 16-bit Japanese

MB, ASCII, UDC


JA16SJISTILDE

The same as JA16SJIS except for the way that the wave dash and the tilde are mapped to and from Unicode.

MB, ASCII, UDC


KO16MSWIN949

MS Windows Code Page 949 Korean

MB, ASCII, UDC


TH8TISASCII

Thai Industrial Standard 620-2533 - ASCII 8-bit

SB, ASCII, EURO


VN8MSWIN1258

MS Windows Code Page 1258 8-bit Vietnamese

SB, ASCII, EURO


ZHS16GBK

GBK 16-bit Simplified Chinese

MB, ASCII, UDC


ZHT16HKSCS

MS Windows Code Page 950 with Hong Kong Supplementary Character Set HKSCS-2001 (character set conversion to and from Unicode is based on Unicode 3.0)

MB, ASCII, EURO


ZHT16MSWIN950

MS Windows Code Page 950 Traditional Chinese

MB, ASCII, UDC


ZHT32EUC

EUC 32-bit Traditional Chinese

MB, ASCII

European





BLT8ISO8859P13

ISO 8859-13 Baltic

SB, ASCII


BLT8MSWIN1257

MS Windows Code Page 1257 8-bit Baltic

SB, ASCII, EURO


CL8ISO8859P5

ISO 8859-5 Latin/Cyrillic

SB, ASCII


CL8MSWIN1251

MS Windows Code Page 1251 8-bit Latin/Cyrillic

SB, ASCII, EURO


EE8ISO8859P2

ISO 8859-2 East European

SB, ASCII


EL8ISO8859P7

ISO 8859-7 Latin/Greek

SB, ASCII, EURO


EL8MSWIN1253

MS Windows Code Page 1253 8-bit Latin/Greek

SB, ASCII, EURO


EE8MSWIN1250

MS Windows Code Page 1250 8-bit East European

SB, ASCII, EURO


NE8ISO8859P10

ISO 8859-10 North European

SB, ASCII


NEE8ISO8859P4

ISO 8859-4 North and North-East European

SB, ASCII


WE8ISO8859P15

ISO 8859-15 West European

SB, ASCII, EURO


WE8MSWIN1252

MS Windows Code Page 1252 8-bit West European

SB, ASCII, EURO

Middle Eastern





AR8ISO8859P6

ISO 8859-6 Latin/Arabic

SB, ASCII


AR8MSWIN1256

MS Windows Code Page 1256 8-Bit Latin/Arabic

SB, ASCII, EURO


IW8ISO8859P8

ISO 8859-8 Latin/Hebrew

SB, ASCII


IW8MSWIN1255

MS Windows Code Page 1255 8-bit Latin/Hebrew

SB, ASCII, EURO


TR8MSWIN1254

MS Windows Code Page 1254 8-bit Turkish

SB, ASCII, EURO


WE8ISO8859P9

ISO 8859-9 West European & Turkish

SB, ASCII

Universal





AL32UTF8

Unicode 4.0 UTF-8 Universal character set

MB, ASCII, EURO


Table A-5 lists the recommended and most commonly used EBCDIC-based Oracle database character sets. The list is ordered alphabetically within their respective language group.

Table A-5 Recommended EBCDIC Database Character Sets


Name Description Comments

Asian





JA16DBCS

IBM EBCDIC 16-bit Japanese

MB, UDC


JA16EBCDIC930

IBM DBCS Code Page 290 16-bit Japanese

MB, UDC


KO16DBCS

IBM EBCDIC 16-bit Korean

MB, UDC


TH8TISEBCDICS

Thai Industrial Standard 620-2533-EBCDIC Server 8-bit

SB

European





BLT8EBCDIC1112S

EBCDIC Code Page 1112 8-bit Server Baltic Multilingual

SB


CE8BS2000

Siemens EBCDIC.DF.04 8-bit Central European

SB


CL8BS2000

Siemens EBCDIC.EHC.LC 8-bit Cyrillic

SB


CL8EBCDIC1025R

EBCDIC Code Page 1025 Server 8-bit Cyrillic

SB


CL8EBCDIC1158R

EBCDIC Code Page 1158 Server 8-bit Cyrillic

SB


D8EBCDIC1141

EBCDIC Code Page 1141 8-bit Austrian German

SB, EURO


DK8DBCDIC1142

EBCDIC Code Page 1142 8-bit Danish

SB, EURO


EE8BS2000

Siemens EBCDIC.DF.04 8-bit East European

SB


EE8EBCDIC870S

EBCDIC Code Page 870 Server 8-bit East European

SB


EL8EBCDIC423R

IBM EBCDIC Code Page 423 for RDBMS server-side

SB


EL8EBCDIC875R

EBCDIC Code Page 875 Server 8-bit Greek

SB


F8EBCDIC1147

EBCDIC Code Page 1147 8-bit French

SB, EURO


I8EBCDIC1144

EBCDIC Code Page 1144 8-bit Italian

SB, EURO


S8EBCDCI1143

EBCDIC Code Page 1143 8-bit Swedish

SB, EURO


WE8BS2000

Siemens EBCDIC.DF.04 8-bit West European

SB


WE8BS2000E

Siemens EBCDIC.DF.04 8-bit West European

SB, EURO


WE8BS2000L5

Siemens EBCDIC.DF.L5 8-bit West European/Turkish

SB


WE8EBCDIC1047E

Latin 1/Open Systems 1047

SB, EBCDIC, EURO


WE8EBCDIC1140

EBCDIC Code Page 1140 8-bit West European

SB, EURO


WE8EBCDIC1145

EBCDIC Code Page 1145 8-bit West European

SB, EURO


WE8DBCDIC1146

EBCDIC Code Page 1146 8-bit West European

SB, EURO


WE8EBCDIC1148

EBCDIC Code Page 1148 8-bit West European

SB, EURO

Middle Eastern





AR8EBCDIC420S

EBCDIC Code Page 420 Server 8-bit Latin/Arabic

SB


IW8EBCDIC424S

EBCDIC Code Page 424 Server 8-bit Latin/Hebrew

SB


TR8EBCDIC1026S

EBCDIC Code Page 1026 Server 8-bit Turkish

SB


Other Character Sets

Table A-6 lists the other ASCII-based Oracle character sets. The list is ordered alphabetically within their language groups.

Table A-6 Other ASCII Character Sets


Name Description Comments

Asian





BN8BSCII

Bangladesh National Code 8-bit BSCII

SB, ASCII


IN8ISCII

Multiple-Script Indian Standard 8-bit Latin/Indian Languages

SB, ASCII


JA16VMS

JVMS 16-bit Japanese

MB, ASCII


KO16KSC5601

KSC5601 16-bit Korean

MB, ASCII


KO16KSCCS

KSCCS 16-bit Korean

MB, ASCII


TH8MACTHAIS

Mac Server 8-bit Latin/Thai

SB, ASCII


VN8VN3

VN3 8-bit Vietnamese

SB, ASCII


ZHS16CGB231280

CGB2312-80 16-bit Simplified Chinese

MB, ASCII


ZHT16BIG5

BIG5 16-bit Traditional Chinese

MB, ASCII


ZHT16CCDC

HP CCDC 16-bit Traditional Chinese

MB, ASCII


ZHT16DBT

Taiwan Taxation 16-bit Traditional Chinese

MB, ASCII


ZHT16HKSCS31

MS Windows Code Page 950 with Hong Kong Supplementary Character Set HKSCS-2001 (character set conversion to and from Unicode is based on Unicode 3.1)

MB, ASCII, EURO


ZHT32SOPS

SOPS 32-bit Traditional Chinese

MB, ASCII


ZHT32TRIS

TRIS 32-bit Traditional Chinese

MB, ASCII

Middle Eastern





AR8ADOS710

Arabic MS-DOS 710 Server 8-bit Latin/Arabic

SB, ASCII


AR8ADOS710T

Arabic MS-DOS 710 8-bit Latin/Arabic

SB


AR8ADOS720

Arabic MS-DOS 720 Server 8-bit Latin/Arabic

SB, ASCII


AR8ADOS720T

Arabic MS-DOS 720 8-bit Latin/Arabic

SB


AR8APTEC715

APTEC 715 Server 8-bit Latin/Arabic

SB, ASCII


AR8APTEC715T

APTEC 715 8-bit Latin/Arabic

SB


AR8ASMO708PLUS

ASMO 708 Plus 8-bit Latin/Arabic

SB, ASCII


AR8ASMO8X

ASMO Extended 708 8-bit Latin/Arabic

SB, ASCII


AR8HPARABIC8T

HP 8-bit Latin/Arabic

SB


AR8ISO8859P6

ISO 8859-6 Latin/Arabic

SB, ASCII


AR8MUSSAD768

Mussa'd Alarabi/2 768 Server 8-bit Latin/Arabic

SB, ASCII


AR8MUSSAD768T

Mussa'd Alarabi/2 768 8-bit Latin/Arabic

SB


AR8NAFITHA711

Nafitha Enhanced 711 Server 8-bit Latin/Arabic

SB, ASCII


AR8NAFITHA711T

Nafitha Enhanced 711 8-bit Latin/Arabic

SB


AR8NAFITHA721

Nafitha International 721 Server 8-bit Latin/Arabic

SB, ASCII


AR8NAFITHA721T

Nafitha International 721 8-bit Latin/Arabic

SB


AR8SAKHR706

SAKHR 706 Server 8-bit Latin/Arabic

SB, ASCII


AR8SAKHR707

SAKHR 707 Server 8-bit Latin/Arabic

SB, ASCII


AR8SAKHR707T

SAKHR 707 8-bit Latin/Arabic

SB


AR8XBASIC

XBASIC 8-bit Latin/Arabic

SB


AZ8ISO8859PE

ISO 8859-9 Latin Azerbaijani

SB, ASCII


IN8ISCII

Multiple-Script Indian Standard 8-bit Latin/Indian Languages

SB, ASCII


IW8MACHEBREW

Mac Client 8-bit Hebrew

SB


IW8PC1507

IBM-PC Code Page 1507/862 8-bit Latin/Hebrew

SB, ASCII


LA8ISO6937

ISO 6937 8-bit Coded Character Set for Text Communication

SB, ASCII


TR7DEC

DEC VT100 7-bit Turkish

SB


TR8DEC

DEC 8-bit Turkish

SB, ASCII


TR8PC857

IBM-PC Code Page 857 8-bit Turkish

SB, ASCII

European





AR8ARABICMAC

Mac Client 8-bit Latin/Arabic

SB


AR8ARABICMACS

Mac Server 8-bit Latin/Arabic

SB, ASCII


BG8MSWIN

MS Windows 8-bit Bulgarian Cyrillic

SB, ASCII


BG8PC437S

IBM-PC Code Page 437 8-bit (Bulgarian Modification)

SB, ASCII


BLT8CP921

Latvian Standard LVS8-92(1) Windows/Unix 8-bit Baltic

SB, ASCII


BLT8PC775

IBM-PC Code Page 775 8-bit Baltic

SB, ASCII


CDN8PC863

IBM-PC Code Page 863 8-bit Canadian French

SB, ASCII


CEL8ISO8859P14

ISO 8859-13 Celtic

SB, ASCII


CL8ISOIR111

ISOIR111 Cyrillic

SB


CL8KOI8R

RELCOM Internet Standard 8-bit Latin/Cyrillic

SB, ASCII


CL8KOI8U

KOI8 Ukrainian Cyrillic

SB


CL8MACCYRILLICS

Mac Server 8-bit Latin/Cyrillic

SB, ASCII


EE8MACCES

Mac Server 8-bit Central European

SB, ASCII


EE8MACCROATIANS

Mac Server 8-bit Croatian

SB, ASCII


EE8PC852

IBM-PC Code Page 852 8-bit East European

SB, ASCII


EL8DEC

DEC 8-bit Latin/Greek

SB


EL8MACGREEKS

Mac Server 8-bit Greek

SB, ASCII


EL8PC437S

IBM-PC Code Page 437 8-bit (Greek modification)

SB, ASCII


EL8PC851

IBM-PC Code Page 851 8-bit Greek/Latin

SB, ASCII


EL8PC869

IBM-PC Code Page 869 8-bit Greek/Latin

SB, ASCII


ET8MSWIN923

MS Windows Code Page 923 8-bit Estonian

SB, ASCII


HU8ABMOD

Hungarian 8-bit Special AB Mod

SB, ASCII


HU8CWI2

Hungarian 8-bit CWI-2

SB, ASCII


IS8PC861

IBM-PC Code Page 861 8-bit Icelandic

SB, ASCII


IW7IS960

Israeli Standard 960 7-bit Latin/Hebrew

SB


IW8ISO8859P8

ISO 8859-8 Latin/Hebrew

SB, ASCII


LA8ISO6937

ISO 6937 8-bit Coded Character Set for Text Communication

SB, ASCII


LA8PASSPORT

German Government Printer 8-bit All-European Latin

SB, ASCII


LT8MSWIN921

MS Windows Code Page 921 8-bit Lithuanian

SB, ASCII


LT8PC772

IBM-PC Code Page 772 8-bit Lithuanian (Latin/Cyrillic)

SB, ASCII


LT8PC774

IBM-PC Code Page 774 8-bit Lithuanian (Latin)

SB, ASCII


LV8PC8LR

Latvian Version IBM-PC Code Page 866 8-bit Latin/Cyrillic

SB, ASCII


LV8PC1117

IBM-PC Code Page 1117 8-bit Latvian

SB, ASCII


LV8RST104090

IBM-PC Alternative Code Page 8-bit Latvian (Latin/Cyrillic)

SB, ASCII


N8PC865

IBM-PC Code Page 865 8-bit Norwegian

SB, ASCII


RU8BESTA

BESTA 8-bit Latin/Cyrillic

SB, ASCII


RU8PC855

IBM-PC Code Page 855 8-bit Latin/Cyrillic

SB, ASCII


RU8PC866

IBM-PC Code Page 866 8-bit Latin/Cyrillic

SB, ASCII


SE8ISO8859P3

ISO 8859-3 South European

SB, ASCII


TR8MACTURKISH

Mac Client 8-bit Turkish

SB


TR8MACTURKISHS

Mac Server 8-bit Turkish

SB, ASCII


TR8PC857

IBM-PC Code Page 857 8-bit Turkish

SB, ASCII


US7ASCII

ASCII 7-bit American

SB, ASCII


US8PC437

IBM-PC Code Page 437 8-bit American

SB, ASCII


WE8DEC

DEC 8-bit West European

SB, ASCII


WE8DG

DG 8-bit West European

SB, ASCII


WE8ISO8859P1

ISO 8859-1 West European

SB, ASCII


WE8MACROMAN8S

Mac Server 8-bit Extended Roman8 West European

SB, ASCII


WE8NCR4970

NCR 4970 8-bit West European

SB, ASCII


WE8NEXTSTEP

NeXTSTEP PostScript 8-bit West European

SB, ASCII


WE8PC850

IBM-PC Code Page 850 8-bit West European

SB, ASCII


WE8PC858

IBM-PC Code Page 858 8-bit West European

SB, ASCII, EURO


WE8PC860

IBM-PC Code Page 860 8-bit West European

SB, ASCII


WE8ROMAN8

HP Roman8 8-bit West European

SB, ASCII

Universal





UTF8

Unicode 3.0 UTF-8 Universal character set, CESU-8 compliant

MB, ASCII, EURO


Table A-7 lists the other EBCDIC-based Oracle character sets. The list is ordered alphabetically within their language groups.

Table A-7 Other EBCDIC Character Sets


Name Description Comments

Asian





TH8TISEBCDIC

Thai Industrial Standard 620-2533 - EBCDIC 8-bit

SB


ZHS16DBCS

IBM EBCDIC 16-bit Simplified Chinese

MB, UDC


ZHT16DBCS

IBM EBCDIC 16-bit Traditional Chinese

MB, UDC

Middle Eastern





AR8EBCDICX

EBCDIC XBASIC Server 8-bit Latin/Arabic

SB


IW8EBCDIC424

EBCDIC Code Page 424 8-bit Latin/Hebrew

SB


IW8EBCDIC1086

EBCDIC Code Page 1086 8-bit Hebrew

SB


TR8EBCDIC1026

EBCDIC Code Page 1026 8-bit Turkish

SB


WE8EBCDIC37C

EBCDIC Code Page 37 8-bit Oracle/c

SB

European





BLT8EBCDIC1112

EBCDIC Code Page 1112 8-bit Server Baltic Multilingual

SB


CL8EBCDIC1025

EBCDIC Code Page 1025 8-bit Cyrillic

SB


CL8EBCDIC1025C

EBCDIC Code Page 1025 Client 8-bit Cyrillic

SB


CL8EBCDIC1025S

EBCDIC Code Page 1025 Server 8-bit Cyrillic

SB


CL8EBCDIC1025X

EBCDIC Code Page 1025 (Modified) 8-bit Cyrillic

SB


CL8EBCDIC1158

EBCDIC Code Page 1158 8-bit Cyrillic

SB


D8BS2000

Siemens 9750-62 EBCDIC 8-bit German

SB


D8EBCDIC273

EBCDIC Code Page 273/1 8-bit Austrian German

SB


DK7SIEMENS9780X

Siemens 97801/97808 7-bit Danish

SB


DK8BS2000

Siemens 9750-62 EBCDIC 8-bit Danish

SB


DK8EBCDIC277

EBCDIC Code Page 277/1 8-bit Danish

SB


E8BS2000

Siemens 9750-62 EBCDIC 8-bit Spanish

SB


EE8EBCDIC870

EBCDIC Code Page 870 8-bit East European

SB


EE8EBCDIC870C

EBCDIC Code Page 870 Client 8-bit East European

SB


EL8EBCDIC875

EBCDIC Code Page 875 8-bit Greek

SB


EL8GCOS7

Bull EBCDIC GCOS7 8-bit Greek

SB


F8BS2000

Siemens 9750-62 EBCDIC 8-bit French

SB


F8EBCDIC297

EBCDIC Code Page 297 8-bit French

SB


I8EBCDIC280

EBCDIC Code Page 280/1 8-bit Italian

SB


S8BS2000

Siemens 9750-62 EBCDIC 8-bit Swedish

SB


S8EBCDIC278

EBCDIC Code Page 278/1 8-bit Swedish

SB


US8ICL

ICL EBCDIC 8-bit American

SB


US8BS2000

Siemens 9750-62 EBCDIC 8-bit American

SB


WE8EBCDIC924

Latin 9 EBCDIC 924

SB, EBCDIC


WE8EBCDIC37

EBCDIC Code Page 37 8-bit West European

SB


WE8EBCDIC284

EBCDIC Code Page 284 8-bit Latin American/Spanish

SB


WE8EBCDIC285

EBCDIC Code Page 285 8-bit West European

SB


WE8EBCDIC1047

EBCDIC Code Page 1047 8-bit West European

SB


WE8EBCDIC1140C

EBCDIC Code Page 1140 8-bit West European

SB, EURO


WE8EBCDIC1148C

EBCDIC Code Page 1148 Client 8-bit West European

SB, EURO


WE8EBCDIC500C

EBCDIC Code Page 500 8-bit Oracle/c

SB


WE8EBCDIC500

EBCDIC Code Page 500 8-bit West European

SB


WE8EBCDIC871

EBCDIC Code Page 871 8-bit Icelandic

SB


WE8ICL

ICL EBCDIC 8-bit West European

SB


WE8GCOS7

Bull EBCDIC GCOS7 8-bit West European

SB

Universal





UTFE

EBCDIC form of Unicode 3.0 UTF-8 Universal character set (UTF-EBCDIC)

MB, EURO


Character Sets that Support the Euro Symbol

Table A-8 lists the character sets that support the Euro symbol.

Table A-8 Character Sets that Support the Euro Symbol

Character Set Name Hexadecimal Code Value of the Euro Symbol

AL16UTF16

20AC

AL32UTF8

E282AC

AR8MSWIN1256

80

BLT8MSWIN1257

80

CL8EBCDIC1158

E1

CL8EBCDIC1158R

9F

CL8MSWIN1251

88

D8EBCDIC1141

9F

DK8EBCDIC1142

5A

EE8MSWIN1250

80

EL8EBCDIC423R

FD

EL8EBCDIC875R

DF

EL8ISO8859P7

A4

EL8MSWIN1253

80

F8EBCDIC1147

9F

I8EBCDIC1144

9F

IW8MSWIN1255

80

KO16KSC5601

A2E6

KO16KSCCS

D9E6

KO16MSWIN949

A2E6

S8EBCDIC1143

5A

TH8TISASCII

80

TR8MSWIN1254

80

UTF8

E282AC

UTFE

CA4653

VN8MSWIN1258

80

WE8BS2000E

9F

WE8EBCDIC1047E

9F

WE8EBCDIC1140

9F

WE8EBCDIC1140C

9F

WE8EBCDIC1145

9F

WE8EBCDIC1146

9F

WE8EBCDIC1148

9F

WE8EBCDIC1148C

9F

WE8EBCDIC924

9F

WE8ISO8859P15

A4

WE8MACROMAN8

DB

WE8MACROMAN8S

DB

WE8MSWIN1252

80

WE8PC858

DF

ZHS32GB18030

A2E3

ZHT16HKSCS

A3E1

ZHT16HKSCS31

A3E1

ZHT16MSWIN950

A3E1


Client-Only Character Sets

Table A-9 lists the Oracle character sets that are supported as client-only character sets. The list is ordered alphabetically within their respective language groups.

Table A-9 Client-Only Character Sets


Name Description Comments

Asian





JA16EUCYEN

EUC 24-bit Japanese with '\' mapped to the Japanese yen character

MB


JA16MACSJIS

Mac client Shift-JIS 16-bit Japanese

MB


JA16SJISYEN

Shift-JIS 16-bit Japanese with '\' mapped to the Japanese yen character

MB, UDC


TH8MACTHAI

Mac Client 8-bit Latin/Thai

SB


ZHS32GB18030

GB18030-2000

MB, ASCII, EURO


ZHS16MACCGB231280

Mac client CGB2312-80 16-bit Simplified Chinese

MB

European





CH7DEC

DEC VT100 7-bit Swiss (German/French)

SB


CL8MACCYRILLIC

Mac Client 8-bit Latin/Cyrillic

SB


D7SIEMENS9780X

Siemens 97801/97808 7-bit German

SB


D7DEC

DEC VT100 7-bit German

SB


EEC8EUROASCI

EEC Targon 35 ASCI West European/Greek

SB


EEC8EUROPA3

EEC EUROPA3 8-bit West European/Greek

SB


EE8MACCROATIAN

Mac Client 8-bit Croatian

SB


EE8MACCE

Mac Client 8-bit Central European

SB


EL8PC737

IBM-PC Code Page 737 8-bit Greek/Latin

SB


EL8MACGREEK

Mac Client 8-bit Greek

SB


E7DEC

DEC VT100 7-bit Spanish

SB


E7SIEMENS9780X

Siemens 97801/97808 7-bit Spanish

SB


F7DEC

DEC VT100 7-bit French

SB


F7SIEMENS9780X

Siemens 97801/97808 7-bit French

SB


I7DEC

DEC VT100 7-bit Italian

SB


I7SIEMENS9780X

Siemens 97801/97808 7-bit Italian

SB


IS8MACICELANDICS

Mac Server 8-bit Icelandic

SB


IS8MACICELANDIC

Mac Client 8-bit Icelandic

SB


NL7DEC

DEC VT100 7-bit Dutch

SB


NDK7DEC

DEC VT100 7-bit Norwegian/Danish

SB


N7SIEMENS9780X

Siemens 97801/97808 7-bit Norwegian

SB


SF7DEC

DEC VT100 7-bit Finnish

SB


S7SIEMENS9780X

Siemens 97801/97808 7-bit Swedish

SB


S7DEC

DEC VT100 7-bit Swedish

SB


SF7ASCII

ASCII 7-bit Finnish

SB


TR7DEC

DEC VT100 7-bit Turkish

SB


WE8ISOICLUK

ICL special version ISO8859-1

SB


WE8MACROMAN8

Mac Client 8-bit Extended Roman8 West European

SB


WE8HP

HP LaserJet 8-bit West European

SB


YUG7ASCII

ASCII 7-bit Yugoslavian

SB

Middle Eastern





AR8ARABICMAC

Mac Client 8-bit Latin/Arabic

SB


AR8ARABICMACT

Mac 8-bit Latin/Arabic

SB


AR8MUSSAD768

Mussa'd Alarabi/2 768 Server 8-bit Latin/Arabic

SB, ASCII


IW7IS960

Israeli Standard 960 7-bit Latin/Hebrew

SB


IW8MACHEBREW

Mac Client 8-bit Hebrew

SB


TR8MACTURKISH

Mac Client 8-bit Turkish

SB


Universal Character Sets

Table A-10 lists the Oracle character sets that provide universal language support. They attempt to support all languages of the world, including, but not limited to, Asian, European, and Middle Eastern languages.

Table A-10 Universal Character Sets

Name Description Comments

AL16UTF16

Unicode 4.0 UTF-16 Universal character set

MB, EURO, FIXED

AL32UTF8

Unicode 4.0 UTF-8 Universal character set

MB, ASCII, EURO

UTF8

Unicode 3.0 UTF-8 Universal character set, CESU-8 compliant

MB, ASCII, EURO

UTFE

EBCDIC form of Unicode 3.0 UTF-8 Universal character set (UTF-EBCDIC)

MB, EURO


Note:

CESU-8 defines an encoding scheme for Unicode that is identical to UTF-8 except for its representation of supplementary characters. In CESU-8, supplementary characters are represented as six-byte sequences that result from the transformation of each UTF-16 surrogate code unit into an eight-bit form that is similar to the UTF-8 transformation, but without first converting the input surrogate pairs to a scalar value. See Unicode Technical Report #26.

Character Set Conversion Support

The following character set encodings are supported for conversion only. They cannot be used as the database or national character set:

AL16UTF16LE
ISO2022-CN
ISO2022-JP
ISO2022-KR
HZ-GB-2312

You can use these character sets as the source_char_set or dest_char_set in the CONVERT function.

See Oracle Database SQL Reference for more information about the CONVERT function and "The CONVERT Function".

Subsets and Supersets

Table A-11 lists common subset/superset relationships.

Table A-11 Subset-Superset Pairs

Subset Superset

AR8ADOS710

AR8ADOS710T

AR8ADOS720

AR8ADOS720T

AR8ADOS720T

AR8ADOS720

AR8APTEC715

AR8APTEC715T

AR8ARABICMACT

AR8ARABICMAC

AR8ISO8859P6

AR8ASMO708PLUS

AR8ISO8859P6

AR8ASMO8X

AR8MUSSAD768

AR8MUSSAD768T

AR8MUSSAD768T

AR8MUSSAD768

AR8NAFITHA711

AR8NAFITHA711T

AR8NAFITHA721

AR8NAFITHA721T

AR8SAKHR707

AR8SAKHR707T

AR8SAKHR707T

AR8SAKHR707

BLT8CP921

BLT8ISO8859P13

BLT8CP921

LT8MSWIN921

D7DEC

D7SIEMENS9780X

D7SIEMENS9780X

D7DEC

DK7SIEMENS9780X

N7SIEMENS9780X

I7DEC

I7SIEMENS9780X

I7SIEMENS9780X

IW8EBCDIC424

IW8EBCDIC424

IW8EBCDIC1086

KO16KSC5601

KO16MSWIN949

LT8MSWIN921

BLT8ISO8859P13

LT8MSWIN921

BLT8CP921

N7SIEMENS9780X

DK7SIEMENS9780X

US7ASCII

See Table A-12, "US7ASCII Supersets".

UTF8

AL32UTF8

WE8DEC

TR8DEC

WE8DEC

WE8NCR4970

WE8ISO8859P1

WE8MSWIN1252

WE8ISO8859P9

TR8MSWIN1254

WE8NCR4970

TR8DEC

WE8NCR4970

WE8DEC

WE8PC850

WE8PC858


US7ASCII is a special case because so many other character sets are supersets of it. Table A-12 lists supersets for US7ASCII.

Table A-12 US7ASCII Supersets

Supersets Supersets Supersets

AL32UTF8

EE8ISO8859P2

RU8BESTA

AR8ADOS710

EE8MACCES

RU8PC855

AR8ADOS710T

EE8MACCROATIANS

RU8PC866

AR8ADOS720

EE8MSWIN1250

SE8ISO8859P3

AR8ADOS720T

EE8PC852

TH8MACTHAIS

AR8APTEC715

EL8DEC

TH8TISASCII

AR8APTEC715T

EL8ISO8859P7

TR8DEC

AR8ARABICMACS

EL8MACGREEKS

TR8MACTURKISHS

AR8ASMO708PLUS

EL8MSWIN1253

TR8MSWIN1254

AR8ASMO8X

EL8PC437S

TR8PC857

AR8HPARABIC8T

EL8PC851

US8PC437

AR8ISO8859P6

EL8PC869

UTF8

AR8MSWIN1256

ET8MSWIN923

VN8MSWIN1258

AR8MUSSAD768

HU8ABMOD

VN8VN3

AR8MUSSAD768T

HU8CWI2

WE8DEC

AR8NAFITHA711

IN8ISCII

WE8DG

AR8NAFITHA711T

IS8PC861

WE8ISO8859P1

AR8NAFITHA721

IW8ISO8859P8

WE8ISO8859P15

AR8NAFITHA721T

IW8MACHEBREWS

WE8ISO8859P9

AR8SAKHR706

IW8MSWIN1255

WE8MACROMAN8S

AR8SAKHR707

IW8PC1507

WE8MSWIN1252

AR8SAKHR707T

JA16EUC

WE8NCR4970

AZ8ISO8859PE

JA16SJIS

WE8NEXTSTEP

BG8MSWIN

JA16VMS

WE8PC850

BG8PC437S

KO16KSC5601

WE8PC858

BLT8CP921

KO16KSCCS

WE8PC860

BLT8ISO8859P13

KO16MSWIN949

WE8ROMAN8

BLT8MSWIN1257

LA8ISO6937

ZHS16CGB231280

BLT8PC775

LA8PASSPORT

ZHS16GBK

BN8BSCII

LT8MSWIN921

ZHT16BIG5

CDN8PC863

LT8PC772

ZHT16CCDC

CEL8ISO8859P14

LT8PC774

ZHT16DBT

CL8ISO8859P5

LV8PC1117

ZHT16HKSCS

CL8KOI8R

LV8PC8LR

ZHT16MSWIN950

CL8KOI8U

LV8RST104090

ZHT32EUC

CL8ISOIR111

N8PC865

ZHT32SOPS

CL8MACCYRILLICS

NE8ISO8859P10

ZHT32TRIS

CL8MSWIN1251

NEE8ISO8859P4

ZHS32GB18030


Language and Character Set Detection Support

Table A-13 displays the languages and character sets that are supported by the language and character set detection in the Character Set Scanner utilities (CSSCAN and LCSSCAN) and the Globalization Development Kit (GDK).

Each language has several character sets that can be detected.

When the binary values for a language match two or more encodings that have a subset/superset relationship, the subset character set is returned. For example, if the language is German and all characters are 7-bit, then US7ASCII is returned instead of WE8MSWIN1252, WE8ISO8859P15, or WE8ISO8859P1.

When the character set is determined to be UTF-8, the Oracle character set UTF8 is returned by default unless 4-byte characters (supplementary characters) are detected within the text. If 4-byte characters are detected, then the character set is reported as AL32UTF8.

Table A-13 Languages and Character Sets Supported by CSSCAN, LCSSCAN, and GDK

Language Character Sets

Arabic

AL16UTF16, AL32UTF8, AR8ISO8859P6, AR8MSWIN1256, UTF8

Bulgarian

AL16UTF16, AL32UTF8, CL8ISO8859P5, CL8MSWIN1251, UTF8

Catalan

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Croatian

AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8

Czech

AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8

Danish

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Dutch

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

English

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Estonian

AL16UTF16, AL32UTF8, NEE8IOS8859P4, UTF8

Finnish

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

French

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

German

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Greek

AL16UTF16, AL32UTF8, EL8ISO8859P7, EL8MSWIN1253, UTF8

Hebrew

AL16UTF16, AL32UTF8, IW8ISO8859P8, IW8MSWIN1255, UTF8

Hungarian

AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8

Italian

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Japanese

AL16UTF16, AL32UTF8, ISO2022-JP, JA16EUC, JA16SJIS, UTF8

Korean

AL16UTF16, AL32UTF8, ISO2022-KR, KO16KSC5601, KO16MSWIN949, UTF8

Malay

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Norwegian

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Polish

AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8

Portuguese

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Romanian

AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8

Russian

AL16UTF16, AL32UTF8, CL8ISO8859P5, CL8KOI8R, CL8MSWIN1251, UTF8

Simplified Chinese

AL16UTF16, AL32UTF8, HZ-GB-2312, UTF8, ZHS16GBK, ZHS16CGB231280

Slovak

AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8

Spanish

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Swedish

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Thai

AL16UTF16, AL32UTF8, TH8TISASCII, UTF8

Traditional Chinese

AL16UTF16, AL32UTF8, UTF8, ZHT16MSWIN950

Turkish

AL16UTF16, AL32UTF8, TR8MSWIN1254, UTF8, WE8ISO8859P9


Linguistic Sorts

Oracle offers two kinds of linguistic sorts, monolingual and multilingual. In addition, monolingual sorts can be extended to handle special cases. These special cases (represented with a prefix X) typically mean that the characters are sorted differently from their ASCII values. For example, ch and ll are treated as a single character in XSPANISH.

All of the linguistic sorts can be also be performed as case-insensitive or accent-insensitive by appending _CI or _AI to the linguistic sort name.

Table A-14 lists the monolingual linguistic sorts supported by the Oracle server.

See Also:

Table A-1, "Oracle Supported Languages" for a list of the default sort for each language

Table A-14 Monolingual Linguistic Sorts

Basic Name Extended Name Special Cases

ARABIC

-

-

ARABIC_MATCH

-

-

ARABIC_ABJ_SORT

-

-

ARABIC_ABJ_MATCH

-

-

ASCII7

-

-

AZERBAIJANI

XAZERBAIJANI

i, I, lowercase i without dot, uppercase I with dot

BENGALI

-

-

BIG5

-

-

BINARY

-

-

BULGARIAN

-

-

CATALAN

XCATALAN

æ, AE, ß

CROATIAN

XCROATIAN

D, L, N, d, l, n, ß

CZECH

XCZECH

ch, CH, Ch, ß

CZECH_PUNCTUATION

XCZECH_PUNCTUATION

ch, CH, Ch, ß

DANISH

XDANISH

A, ß, Å, å

DUTCH

XDUTCH

ij, IJ

EBCDIC

-

-

EEC_EURO

-

-

EEC_EUROPA3

-

-

ESTONIAN

-

-

FINNISH

-

-

FRENCH

XFRENCH

-

GERMAN

XGERMAN

ß


GERMAN_DIN

XGERMAN_DIN

ß, ä, ö, ü, Ä, Ö, Ü

GBK

-

-

GREEK

-

-

HEBREW

-

-

HKSCS

-

-

HUNGARIAN

XHUNGARIAN

cs, gy, ny, sz, ty, zs, ß, CS, Cs, GY, Gy, NY, Ny, SZ, Sz, TY, Ty, ZS, Zs

ICELANDIC

-

-

INDONESIAN

-

-

ITALIAN

-

-

LATIN

-

-

LATVIAN

-

-

LITHUANIAN

-

-

MALAY

-

-

NORWEGIAN

-

-

POLISH

-

-

PUNCTUATION

XPUNCTUATION

-

ROMANIAN

-

-

RUSSIAN

-

-

SLOVAK

XSLOVAK

dz, DZ, Dz, ß (caron)

SLOVENIAN

XSLOVENIAN

ß


SPANISH

XSPANISH

ch, ll, CH, Ch, LL, Ll

SWEDISH

-

-

SWISS

XSWISS

ß


TURKISH

XTURKISH

æ, AE, ß

UKRAINIAN

-

-

UNICODE_BINARY

-

-

VIETNAMESE

-

-

WEST_EUROPEAN

XWEST_EUROPEAN

ß



Table A-15 lists the multilingual linguistic sorts available in Oracle. All of them include GENERIC_M (an ISO standard for sorting Latin-based characters) as a base. Multilingual linguistic sorts are used for a specific primary language together with Latin-based characters. For example, KOREAN_M sorts Korean and Latin-based characters, but it does not collate Chinese, Thai, or Japanese characters.

Table A-15 Multilingual LInguistic Sorts

Sort Name Description

CANADIAN_M

Canadian French sort supports reverse secondary, special expanding characters

DANISH_M

Danish sort supports sorting uppercase characters before lowercase characters

FRENCH_M

French sort supports reverse sort for secondary

GENERIC_M

Generic sorting order which is based on ISO14651 and Unicode canonical equivalence rules but excluding compatible equivalence rules

JAPANESE_M

Japanese sort supports SJIS character set order and EUC characters which are not included in SJIS

KOREAN_M

Korean sort: Hangul characters are based on Unicode binary order. Hanja characters based on pronunciation order. All Hangul characters are before Hanja characters

SPANISH_M

Traditional Spanish sort supports special contracting characters

THAI_M

Thai sort supports swap characters for some vowels and consonants

SCHINESE_RADICAL_M

Simplified Chinese sort based on radical as primary order and number of strokes order as secondary order

SCHINESE_STROKE_M

Simplified Chinese sort uses number of strokes as primary order and radical as secondary order

SCHINESE_PINYIN_M

Simplified Chinese PinYin sorting order

TCHINESE_RADICAL_M

Traditional Chinese sort based on radical as primary order and number of strokes order as secondary order

TCHINESE_STROKE_M

Traditional Chinese sort uses number of strokes as primary order and radical as secondary order. It supports supplementary characters.


Calendar Systems

By default, most territory definitions use the Gregorian calendar system. Table A-14 lists the other calendar systems supported by the Oracle server.

Table A-16 Supported Calendar Systems

Name Default Date Format Character Set Used For Default Date Format

Japanese Imperial

EEYYMMDD

JA16EUC

ROC Official

EEyymmdd

ZHT32EUC

Thai Buddha

dd month EE yyyy

TH8TISASCII

Persian

DD Month YYYY

AR8ASMO8X

Arabic Hijrah

DD Month YYYY

AR8ISO8859P6

English Hijrah

DD Month YYYY

AR8ISO8859P6


Figure A-1 shows how March 27, 1998 appears in Japanese Imperial.

Figure A-1 Japanese Imperial Example

Description of Figure A-1 follows
Description of "Figure A-1 Japanese Imperial Example"

Time Zone Names

Table A-17 shows the time zone names in the default time zone file that is supplied with the Oracle Database. The default time zone file is $ORACLE_HOME/oracore/zoneinfo/timezlrg.dat. Oracle also supplies a smaller time zone file, $ORACLE_HOME/oracore/zoneinfo/timezone/dat. See Chapter 4, "Datetime Datatypes and Time Zone Support" for more information regarding time zone files.

Table A-17 Time Zone Names

Time Zone Name Is It in the Smaller Time Zone File? Time Zone Name Is It in the Smaller Time Zone File?

Africa/Algiers

No

Australia/Perth

Yes

Africa/Cairo

Yes

Australia/Queensland

Yes

Africa/Casablanca

No

Australia/South

Yes

Africa/Ceuta

No

Australia/Sydney

Yes

Africa/Djibouti

No

Australia/Tasmania

Yes

Africa/Freetown

No

Australia/Victoria

Yes

Africa/Johannesburg

No

Australia/West

Yes

Africa/Khartoum

No

Australia/Yancowinna

Yes

Africa/Mogadishu

No

Brazil/Acre

Yes

Africa/Nairobi

No

Brazil/DeNoronha

Yes

Africa/Nouakchott

No

Brazil/East

Yes

Africa/Tripoli

Yes

Brazil/West

Yes

Africa/Tunis

No

CET

Yes

Africa/Windhoek

No

CST

Yes

America/Adak

Yes

CST6CDT

Yes

America/Anchorage

Yes

Canada/Atlantic

Yes

America/Anguilla

No

Canada/Central

Yes

America/Araguaina

No

Canada/East-Saskatchewan

Yes

America/Aruba

No

Canada/Eastern

Yes

America/Asuncion

No

Canada/Mountain

Yes

America/Atka

Yes

Canada/Newfoundland

Yes

America/Belem

No

Canada/Pacific

Yes

America/Boa_Vista

No

Canada/Saskatchewan

Yes

America/Bogota

No

Canada/Yukon

Yes

America/Boise

No

Chile/Continental

Yes

America/Buenos_Aires

No

Chile/EasterIsland

Yes

America/Cambridge_Bay

No

Cuba

Yes

America/Cancun

No

EET

Yes

America/Caracas

No

EST

Yes

America/Cayenne

No

EST5EDT

Yes

America/Cayman

No

Egypt

Yes

America/Chicago

Yes

Eire

Yes

America/Chihuahua

No

Etc/GMT

Yes

America/Costa_Rica

No

Etc/GMT+0

Yes

America/Cuiaba

No

Etc/GMT+1

Yes

America/Curacao

No

Etc/GMT+10

Yes

America/Dawson

No

Etc/GMT+11

Yes

America/Dawson_Creek

No

Etc/GMT+12

Yes

America/Denver

Yes

Etc/GMT+2

Yes

America/Detroit

Yes

Etc/GMT+3

Yes

America/Edmonton

Yes

Etc/GMT+4

Yes

America/El_Salvador

No

Etc/GMT+5

Yes

America/Ensenada

Yes

Etc/GMT+6

Yes

America/Fort_Wayne

Yes

Etc/GMT+7

Yes

America/Fortaleza

No

Etc/GMT+8

Yes

America/Godthab

No

Etc/GMT+9

Yes

America/Goose_Bay

No

Etc/GMT-0

Yes

America/Grand_Turk

No

Etc/GMT-1

Yes

America/Guadeloupe

No

Etc/GMT-10

Yes

America/Guatemala

No

Etc/GMT-11

Yes

America/Guayaquil

No

-

-

America/Halifax

Yes

Etc/GMT-12

Yes

America/Havana

Yes

Etc/GMT-13

Yes

America/Indiana/Indianapolis

Yes

Etc/GMT-2

Yes

America/Indiana/Knox

No

Etc/GMT-3

Yes

America/Indiana/Marengo

No

Etc/GMT-4

Yes

America/Indiana/Vevay

No

Etc/GMT-5

Yes

America/Indianapolis

Yes

Etc/GMT-6

Yes

America/Inuvik

No

Etc/GMT-7

Yes

America/Iqaluit

No

Etc/GMT-8

Yes

America/Jamaica

Yes

Etc/GMT-9

Yes

America/Juneau

No

Etc/GMT0

Yes

America/Knox_IN

No

Etc/Greenwich

Yes

America/La_Paz

No

Europe/Amsterdam

No

America/Lima

No

Europe/Athens

No

America/Los_Angeles

Yes

Europe/Belfast

No

America/Louisville

No

Europe/Belgrade

No

America/Maceio

No

Europe/Berlin

No

America/Managua

No

Europe/Bratislava

No

America/Manaus

Yes

Europe/Brussels

No

America/Martinique

No

Europe/Bucharest

No

America/Mazatlan

Yes

Europe/Budapest

No

America/Mexico_City

Yes

Europe/Copenhagen

No

America/Miquelon

No

Europe/Dublin

Yes

America/Montevideo

No

Europe/Gibraltar

No

America/Montreal

Yes

Europe/Helsinki

No

America/Montserrat

No

Europe/Istanbul

Yes

America/New_York

Yes

Europe/Kaliningrad

No

America/Nome

No

Europe/Kiev

No

America/Noronha

Yes

Europe/Lisbon

Yes

America/Panama

No

Europe/Ljubljana

No

America/Phoenix

Yes

Europe/London

Yes

America/Porto_Acre

No

Europe/Luxembourg

No

America/Porto_Velho

No

Europe/Madrid

No

America/Puerto_Rico

No

Europe/Minsk

No

America/Rankin_Inlet

No

Europe/Monaco

No

America/Regina

Yes

Europe/Moscow

Yes

America/Rio_Branco

Yes

-

-

America/Santiago

Yes

Europe/Oslo

No

America/Sao_Paulo

Yes

Europe/Paris

No

America/Scoresbysund

No

Europe/Prague

No

America/Shiprock

Yes

Europe/Riga

No

America/St_Johns

Yes

Europe/Rome

No

America/St_Thomas

No

Europe/Samara

No

America/Swift_Current

No

Europe/San_Marino

No

America/Tegucigalpa

No

Europe/Sarajevo

No

America/Thule

No

Europe/Simferopol

No

America/Thunder_Bay

No

Europe/Skopje

No

America/Tijuana

Yes

Europe/Sofia

No

America/Tortola

No

Europe/Stockholm

No

America/Vancouver

Yes

Europe/Tallinn

No

America/Virgin

No

Europe/Tirane

No

America/Whitehorse

Yes

Europe/Vatican

No

America/Winnipeg

Yes

Europe/Vienna

No

America/Yellowknife

No

Europe/Vilnius

No

Arctic/Longyearbyen

No

Europe/Warsaw

Yes

Asia/Aden

No

Europe/Zagreb

No

Asia/Almaty

No

Europe/Zurich

No

Asia/Amman

No

GB

Yes

Asia/Anadyr

No

GB-Eire

Yes

Asia/Aqtau

No

GMT

Yes

Asia/Aqtobe

No

GMT+0

Yes

Asia/Baghdad

No

GMT-0

Yes

Asia/Bahrain

No

GMT0

Yes

Asia/Baku

No

Greenwich

Yes

Asia/Bangkok

No

HST

Yes

Asia/Beirut

No

Hongkong

Yes

Asia/Bishkek

No

Iceland

Yes

Asia/Calcutta

Yes

Indian/Chagos

No

Asia/Chongqing

No

-

-

Asia/Chungking

No

Indian/Christmas

No

Asia/Dacca

No

Indian/Cocos

No

Asia/Damascus

No

Indian/Mayotte

No

Asia/Dhaka

No

-

-

Asia/Dubai

No

Indian/Reunion

No

Asia/Gaza

No

Iran

Yes

Asia/Harbin

No

Israel

Yes

Asia/Hong_Kong

Yes

Jamaica

Yes

Asia/Irkutsk

No

Japan

Yes

Asia/Istanbul

Yes

Kwajalein

Yes

Asia/Jakarta

No

Libya

Yes

Asia/Jayapura

No

MET

Yes

Asia/Jerusalem

Yes

MST

Yes

Asia/Kabul

No

MST7MDT

Yes

Asia/Kamchatka

No

Mexico/BajaNorte

Yes

Asia/Karachi

No

Mexico/BajaSur

Yes

Asia/Kashgar

No

Mexico/General

Yes

Asia/Krasnoyarsk

No

NZ

Yes

Asia/Kuala_Lumpur

No

NZ-CHAT

Yes

Asia/Kuching

No

Navajo

Yes

Asia/Kuwait

No

PRC

Yes

Asia/Macao

No

PST

Yes

Asia/Macau

No

-

-

Asia/Magadan

No

PST8PDT

Yes

Asia/Manila

No

Pacific/Auckland

Yes

Asia/Muscat

No

Pacific/Chatham

Yes

Asia/Nicosia

No

Pacific/Easter

Yes

Asia/Novosibirsk

No

Pacific/Fakaofo

No

Asia/Omsk

No

Pacific/Fiji

No

Asia/Qatar

No

Pacific/Gambier

No

Asia/Rangoon

No

Pacific/Guam

No

Asia/Riyadh

Yes

Pacific/Honolulu

Yes

Asia/Saigon

No

Pacific/Johnston

No

Asia/Seoul

Yes

Pacific/Kiritimati

No

Asia/Shanghai

Yes

Pacific/Kwajalein

Yes

Asia/Singapore

Yes

Pacific/Marquesas

No

Asia/Taipei

Yes

Pacific/Midway

No

Asia/Tashkent

No

Pacific/Niue

No

Asia/Tbilisi

No

Pacific/Norfolk

No

Asia/Tehran

Yes

Pacific/Noumea

No

Asia/Tel_Aviv

Yes

Pacific/Pago_Pago

Yes

Asia/Tokyo

Yes

Pacific/Pitcairn

No

Asia/Ujung_Pandang

No

Pacific/Rarotonga

No

Asia/Urumqi

No

Pacific/Saipan

No

Asia/Vladivostok

No

Pacific/Samoa

Yes

Asia/Yakutsk

No

Pacific/Tahiti

No

Asia/Yekaterinburg

No

Pacific/Tongatapu

No

Asia/Yerevan

No

Pacific/Wake

No

Atlantic/Azores

No

Pacific/Wallis

No

Atlantic/Bermuda

No

Poland

Yes

Atlantic/Canary

No

Portugal

Yes

Atlantic/Faeroe

No

ROC

Yes

Atlantic/Madeira

No

ROK

Yes

Atlantic/Reykjavik

Yes

Singapore

Yes

Atlantic/St_Helena

No

Turkey

Yes

Atlantic/Stanley

No

US/Alaska

Yes

Australia/ACT

Yes

US/Aleutian

Yes

Australia/Adelaide

Yes

US/Arizona

Yes

Australia/Brisbane

Yes

US/Central

Yes

Australia/Broken_Hill

Yes

US/East-Indiana

Yes

Australia/Canberra

Yes

US/Eastern

Yes

Australia/Darwin

Yes

US/Hawaii

Yes

Australia/Hobart

Yes

US/Indiana-Starke

No

Australia/LHI

Yes

US/Michigan

Yes

Australia/Lindeman

Yes

US/Mountain

Yes

Australia/Lord_Howe

Yes

US/Pacific

Yes

Australia/Melbourne

Yes

US/Pacific-New

Yes

Australia/NSW

Yes

US/Samoa

Yes

Australia/North

Yes

UTC

No

-

-

W-SU

Yes

-

-

WET

Yes


Obsolete Locale Data

This section contains information about obsolete linguistic sorts, character sets, languages, and territories. The obsolete linguistic sort, language, and territory definitions are still available. However, they are supported for backward compatibility only; they may be desupported in a future release. You can obtain a listing of the obsolete character sets, languages, territories, and linguistic sorts for the current database release by querying the V$NLS_VALID_VALUES view.

Obsolete Linguistic Sorts

Table A-18 contains linguistic sorts that have been desupported in Oracle Database 10g.

Table A-18 Obsolete Linguistic Sorts in Oracle Database 10g

Obsolete Sort Name Replacement Sort

THAI_TELEPHONE

THAI_M

THAI_DICTIONARY

THAI_M

CANADIAN FRENCH

CANADIAN_M

JAPANESE

JAPANESE_M


Obsolete Territories

Table A-19 contains territories that have been desupported in Oracle Database 10g.

Table A-19 Obsolete Territories

Obsolete Territory Name Replacement Territory

CIS

RUSSIA

MACEDONIA

FYR MACEDONIA

YUGOSLAVIA

SERBIA AND MONTENEGRO

CZECHOSLOVAKIA

CZECH REPUBLIC or SLOVAKIA


Obsolete Languages

Table A-20 contains languages that have been desupported in Oracle Database 10g.

Table A-20 Obsolete Languages

Obsolete Language Name Replacement Language

BENGALI

BANGLA


New Names for Obsolete Character Sets

Table A-21 lists the obsolete character sets. If you reference any of these character sets in your code, then replace them with their new name.

Table A-21 New Names for Obsolete Character Sets

Old Name New Name

AL24UTFSS

UTF8, AL32UTF8

AR8MSAWIN

AR8MSWIN1256

CL8EBCDIC875S

CL8EBCDIC875R

CL8MSWINDOW31

CL8MSWIN1251

EL8EBCDIC875S

EL8EBCDIC875R

JVMS

JA16VMS

JEUC

JA16EUC

SJIS

JA16SJIS

JDBCS

JA16DBCS

KSC5601

KO16KSC5601

KDBCS

KO16DBCS

CGB2312-80

ZHS16CGB231280

CNS 11643-86

ZHT32EUC

JA16EUCFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHS32EUCFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHS16GBKFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

JA16DBCSFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

KO16DBCSFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHS16DBCSFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHS16CGB231280FIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHT16DBCSFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

KO16KSC5601FIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

JA16SJISFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHT16BIG5FIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHT32TRISFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.


AL24UTFFSS Character Set Desupported

The Unicode Character Set AL24UTFFSS was desupported in Oracle9i. AL24UTFFSS was introduced in version 7 as the Unicode character set supporting UTF-8 encoding scheme based on the Unicode standard 1.1, which is now obsolete. In Oracle Database 10g, Oracle offers the Unicode database character set AL32UTF8, which is based on Unicode 4.0, and UTF8, which is based on Unicode 3.0.

The migration path for an existing AL24UTFFSS database is to upgrade to UTF8 prior to upgrading to Oracle9i. As with all migrations to a new database character set, Oracle Corporation recommends that you use the Character Set Scanner for data analysis before attempting to migrate your existing database character set to UTF8.

Updates to the Oracle Language and Territory Definition Files

Changes have been made to the content in some of the language and territory definition files in Oracle Database 10g. These updates are necessary to correct the legacy definitions which no longer meet the local conventions in some of the Oracle supported languages and territories. These changes include modifications to the currency symbols, month names, and group separators. One example is the local currency symbol for Brazil. This has been updated from Cr$ to R$ in Oracle Database 10g.

Please refer to the "Oracle Language and Territory definition changes" table documented in the $ORACLE_HOME/nls/data/old/data_changes.html file for a detailed list of the changes.

Oracle Database 10g customers should review their existing application code to make sure that the correct cultural conventions that are defined in Oracle Database 10g are being used. For customers who may not be able to make the necessary code changes to support their applications, Oracle offers Oracle9i locale definition files with Oracle Database 10g.

To revert back to the Oracle9i language and territory behavior, perform the following:

  1. Shutdown the database.

  2. Run the script cr9idata.pl from the $ORACLE_HOME/nls/data/old directory.

  3. Set the ORA_NLS10 environment variable to the newly created $ORACLE_HOME/nls/data/9idata directory.

  4. Restart the database.

Steps 2 and 3 will need to be repeated for all 10g database clients that need to revert back to the Oracle9i definition files.

Oracle Corporation strongly recommends that customers use the Oracle Database 10g locale definition files; Oracle9i locale definition files will be desupported in a future release.