Home > Storage > PowerScale (Isilon) > Product Documentation > Storage (general) > PowerScale OneFS: Long Filename Support > Overview
OneFS supports a range of character encodings that accommodate various global languages and dialects. The following table lists and describes the available options:
Encoding scheme | Description |
UTF-8 | Default. UTF-8 is a variable-width character encoding that is used for electronic communication. Defined by the Unicode Standard, the name is derived from Unicode Transformation Format – 8-bit. UTF-8 can encode all 1,112,064 valid character code points in Unicode using one to four one-byte code units. |
Windows-SJIS | Shift JIS (Shift Japanese Industrial Standards, also SJIS, MIME name Shift_JIS). A character encoding for the Japanese language. |
Windows-949 | Unified Hangul Code, or Extended Wansung, also known under Microsoft Windows as Code Page 949. The Microsoft Windows encoding for the Korean language. It is an extension of Wansung Code to include all 11172 Hangul syllables present in Johab. |
Windows-1252 | Also called CP-1252. A single-byte character encoding of the Latin alphabet, used by default in the legacy components of Microsoft Windows for English and many European languages including Spanish, French, and German. |
EUC-KR | Variable-width encoding to represent Korean text using two coded character sets, KS X 1001 (formerly KS C 5601) and either ISO 646:KR (KS X 1003, formerly KS C 5636) or US-ASCII, depending on variant. KS X 2901 (formerly KS C 5861) stipulates the encoding and RFC 1557 dubbed it as EUC-KR. |
EUC-JP | Multibyte, variable-width character encoding system used primarily for Japanese, Korean, and Simplified Chinese. |
EUC-JP-MS | Microsoft compatible encodings for Japanese. |
UTF-8-MAC | A variant UTF-8 encoding used by Apple’s OSX file system |
ISO-8859-1 (Latin-1) | Western European language encoding. ISO 8859-1 encodes what it refers to as Latin alphabet number 1, consisting of 191 characters from the Latin script. |
ISO-8859-2 (Latin-2) | Non-Cyrillic Central and Eastern European language encoding. |
ISO-8859-3 (Latin-3) | Southern European languages including Turkish, Maltese, and Esperanto language encoding. |
ISO-8859-4 (Latin-4) | Estonian, Latvian, Lithuanian, Greenlandic, and Sami. |
ISO-8859-5 (Cyrillic) | Russian, Bulgarian, Belarusian, Macedonian, Serbian, and Ukrainian (partial) language encoding. |
ISO-8859-6 (Arabic) | Arabic language encoding. |
ISO-8859-7 (Greek) | Greek language encoding. |
ISO-8859-8 (Hebrew) | Hebrew language encoding. |
ISO-8859-9 (Latin-5) | Turkish language encoding. |
ISO-8859-10 (Latin-6) | A rearrangement of Latin-4. Considered more useful for Nordic languages. Baltic languages use Latin-4 more. |
ISO-8859-13 (Latin-7) | Added some characters for Baltic languages which were missing from Latin-4 and Latin-6. Related to Windows-1257. |
ISO-8859-14 (Latin-8) | Covers Celtic languages such as Gaelic and the Breton language. |
ISO-8859-15 (Latin-9) | A revision of 8859-1 that removes some little-used symbols, replacing them with the euro sign € and the letters Š, š, Ž, ž, Œ, œ, and Ÿ, which completes the coverage of French, Finnish and Estonian. |
ISO-8859-16 (Latin-10) | Albanian, Croatian, Hungarian, Italian, Polish, Romanian, and Slovene, but also Finnish, French, German, and Irish Gaelic (new orthography) language encoding. |
By default, OneFS automatically uses UTF-8 encoding, as defined by the Unicode standard. Conversion between encodings is accomplished in OneFS using an intermediate translation to UTF-8.
You can view and configure a cluster’s current OneFS global character encoding in the OneFS WebUI by going to File system > File system settings > Character encoding:
Also, you can view and manage a cluster’s character encoding from the OneFS CLI using the following command syntax:
Usage:
isi cluster encoding <action>
[--timeout <integer>]
[{--help | -h}]
Actions:
list List valid filesystem character encodings.
modify Modify the current filesystem character encoding.
view View the current filesystem character encoding.
For example:
# isi cluster encoding view
Current Encoding: UTF-8
Default Encoding: UTF-8
Alternatively, you can configure a cluster’s character encoding from the isi config CLI command subsystem, as follows:
# isi config
Welcome to the Isilon IQ configuration console.
Copyright (c) 2001-2021 Dell Inc. All Rights Reserved.
Enter 'help' to see list of available commands.
Enter 'help <command>' to see help for a specific command.
Enter 'quit' at any prompt to discard changes and exit.
Node build: Isilon OneFS 9.7.0.0
Node serial number: CF2Y5221200104
Cluster1 >>>
For example, the following command sets the encoding for the cluster to ISO-8859-1:
Cluster1 >>> encoding ISO-8859-1
Cluster1 >>> commit
Cluster1 >>> reboot
Warning: Character encoding is typically established during installation of the cluster. Modifying the character encoding setting after installation may render files unreadable when done incorrectly.
Only OneFS supported character sets are available for selection. UTF-8 is the default character set for OneFS nodes. If the cluster character encoding is not set to UTF-8, SMB share names are case-sensitive.