jiskanji(5)jiskanji(5)NAME
jiskanji, jiskanji7, JIS7 - A character encoding system (codeset) for
Japanese
DESCRIPTION
JIS Kanji is a codeset that uses the JIS X0202 symbol extension method
for encoding the JIS X0208 and JIS X0201 character sets. There are two
types of JIS Kanji encoding: 7-bit JIS Kanji code and 8-bit JIS Kanji
code.
7-bit JIS Kanji Code
In 7-bit JIS Kanji encoding, all character values are 7-bit bytes.
Characters are interpreted according to preceding in and out sequences
as follows: Kanji in sequence (ESC $ B)
The code values following the Kanji in sequence (ESC $ B) are
treated as characters in the JIS X0208 Kanji character set.
Kanji out sequence (ESC ( B)
The code values following the Kanji out sequence (ESC ( B) are
treated as ASCII characters. Supplementary Kanji in sequence
(ESC $ ( D)
The code values following the supplementary Kanji in sequence
(ESC $ ( D) are treated as characters in the JIS X0212 supple‐
mentary Kanji character set. User-Defined Character (UDC) in
sequence (ESC $ ( 0)
The code values following the UDC in sequence (ESC $ ( 0) are
treated as characters in the vendor-defined or user-defined
character set. Kana in (SO) and Kana out (SI) sequences
The code values following SO(0x0e) and preceding SI(0x0f) are
treated as characters in the JIS X0201 Katakana character set.
Katakana in sequence (ESC ( I)
Code values following the Katakana in sequence (ESC ( I) are
treated as characters in the JIS X0201 Katakana character set.
In this case, the Kanji out sequence is used to switch back to
ASCII code.
The Katakana in and Kanji out sequences are an alternative to
using the Kana in and out sequences (SO/SI).
8-bit JIS Kanji Code
In 8-bit JIS Kanji encoding, the JIS X0201 Katakana characters are rep‐
resented as 8-bit bytes. Using this form of encoding, in and out
sequences have the following effect: Kanji in sequence (ESC $ B)
Code values following the Kanji in sequence (ESC $ B) are
treated as characters in the JIS X0208 Kanji character set.
Supplementary Kanji in sequence (ESC $ ( D)
Code values following the supplementary Kanji in sequence (ESC $
( D) are treated as characters in the JIS X0212 supplementary
Kanji character set. User-Defined Character (UDC) in sequence
(ESC $ ( 0)
Code values following the UDC in sequence (ESC $ ( 0) are
treated as vendor-defined or user-defined characters. Kanji out
sequence (ESC ( B) Code values following the Kanji out sequence
(ESC ( B) are treated as ASCII characters. Kana in and out
sequences (SI/SO)
These sequences are ignored.
Codeset Conversion
The following codeset converter pairs are available for converting Ja‐
panese characters between jiskanji7 or JIS7 and other encoding formats.
The RESTRICTIONS section discusses some conversion limitations that
apply to these converters.
Refer to iconv_intro(5) for an introduction to codeset conversion. For
more information about the other codeset for which jiskanji7 or JIS7 is
the input or output, see the reference page specified in the list item.
deckanji_jiskanji7 or deckanji_JIS7, jiskanji7_deckanji or JIS7_deck‐
anji
Converting from and to the DEC Kanji codeset: deckanji(5).
eucJP_jiskanji7 or eucJP_JIS7, jiskanji7_eucJP or JIS7_eucJP
Converting from and to Japanese Extended UNIX Code: eucJP(5).
eucTW_jiskanji7 or eucTW_JIS7, jiskanji7_eucTW or JIS7_eucTW
Converting from and to Taiwanese Extended UNIX Code: eucTW(5).
sdeckanji_jiskanji7 or sdeckanji_JIS7, jiskanji7_sdeckanji or
JIS7_sdeckanji
Converting from and to the Super DEC Kanji codeset: sdeck‐
anji(5). SJIS_jiskanji7 or SJIS_JIS7, jiskanji7_SJIS or
JIS7_SJIS
Converting from and to Shift JIS format: SJIS(5).
Shift JIS encoding format is identical to encoding in Microsoft
code-pages used on PC systems. Therefore, you can use these con‐
verters to convert Japanese characters between JIS Kanji and PC
code-page format. For general information on how the operating
system supports PC code pages, see code_page(5).
RESTRICTIONS
The JIS Kanji codeset is not supported directly by a locale but through
code conversion (through the iconv utility, Japanese terminal (tty)
code conversion, and so forth).
In the codeset naming conventions used by the iconv utility, the string
JIS7 indicates 7-bit JIS Kanji code that follows a Katakana in sequence
and the string jiskanji7 indicates 7-bit JIS Kanji code entered between
Kana in and out sequences. The following sequences are valid for input
to the iconv utility but are not generated when code is converted to
jiskanji7: Kanji in (ESC $ @) Kanji in (ESC & @ ESC $ B) Kanji in (ESC
$ ( B) Kanji in (ESC $ ( @) Supplementary Kanji in (ESC $ D) Kana in
(ESC ( J) Kana in (ESC ( H)
In the code naming conventions of the Japanese terminal, the string
jis7 indicates 7-bit JIS Kanji code and the string jis8 indicates 8-bit
JIS Kanji code. When the terminal code is set to jis7, the Kana in and
out sequences (SI/SO) are used for JIS X0201 Katakana character repre‐
sentation.
SEE ALSO
Commands: locale(1)
Others: ascii(5), code_page(5), deckanji(5), eucJP(5), i18n_intro(5),
i18n_printing(5), iconv_intro(5), iso2022jp(5), Japanese(5),
l10n_intro(5), sdeckanji(5), shiftjis(5)jiskanji(5)