latex2rtf | RTF Index | Next Section

Far East Support

Word 2000 contains full support for all Far East features introduced in all previous Asian versions of Word, and so it has the ability to read and write RTF keywords related to such features. All of them should be described in the appropriate sub-section of the Contents of an RTF File section, but details specific to that handling of Far East characters are given here.

Escaped Expressions

An escape expression (for example, \'hh, \\, or \{) is usable in all RTF control words.

Writer:

In general RTF should be written out with all characters above 0x80 in the escaped form, \'hh.

Character code Write out as
0x00 <= ch < 0x20 Escaped (\'hh)
0x20 <= ch < 0x80 Raw (non-escaped) character
0x80 <= ch <= 0xFF Escaped (\'hh)
0x5C, 0x7B, 0x7D (special RTF characters \,{, or }) Escaped (\'hh)

Reader:

When the RTF reader encounters raw characters in the leading-byte range of the double-byte character, it regards the next character as the trailing byte of the double-byte character and combines the two characters into one double-byte character.

Leading byte Trailing byte Validity
Escaped Raw (0x20 <= ch <= 0x7f) Valid (standard format for double-byte character)
Escaped Escaped (other) Valid (standard format for double-byte character)
Raw Raw Valid (RTF-J format for double-byte character)
Raw Escaped Invalid
Note†††Characters that are special RTF symbols (\,{, or }) should always be escaped, preferably using the \'hh syntax.

Character Set

Word J specifies the character set in the font table using \fcharset. Word J interprets \cpg437 as \fcharset0 and \cpg932 as \fcharset128 if it encounters these control words when reading RTF. If both \fcharset and \cpg appear in the font table, \cpg is ignored.

Character Mapping

Word maps single-byte characters according to character set information (for example, Macintosh to ANSI) and leaves double-byte characters unmapped.

Font Family

RTF-J
control words
Definition and the interpretation in Word
\jis RTF-J uses \jis as a control word for character set. Word J interprets this as \ansi, which is the default character set used if the character set is not defined.
\fjminchou and \fjgothic RTF-J uses \fjminchou and \fjgothic to specify font family. Word J interprets these as \fnil, which is the default font family.

ShiftJIS Font Without \cpg or \fcharset

If \cpg or \fcharset control words are not present, Word J uses the text metrics of the font before determining the character set of these fonts. If the font is unknown, Word J assumes it is SHIFTJIS_CHARSET.

Composite Fonts (Associated Fonts for International Runs)

Word J defines control words to specify composite fonts as associated character properties. These control words follow the rule of associated character properties and understand font designation (\af). All other <aprops> are ignored in Word J.

<atext> <losbrun> | <hisbrun> | <dbrun>
<losbrun> \hich \af & <aprops> \dbch \af & <aprops> \loch <ptext>
<hisbrun> \loch \af & <aprops> \dbch \af & <aprops> \hich <ptext>
<dbrun> \loch \af & <aprops> \hich \af & <aprops> \dbch <ptext>
Control Word Definition
\loch Specifies a run of the characters in the low-ANSI (0x00-0x7F) area.
\hich For the characters in the high-ANSI (0x80-0xFF) area.
\dbch Specifies a run of the double-byte characters.

Word J writes out associated character properties in the styles. In the style sheet, the <dbrun> definition should be used for compatibility with applications that have transparent readers.

{\stylesheet{\loch\af5\hich\af5\dbch\f27\fs20\snext0 Normal;}}

If the composite font definition matches the style, only the control word (\loch, \hich, or \dbch) will be used to distinguish the type of run, together with the font information for transparent readers.

{\fonttbl{\f5\fswiss\fcharset0\fprq2 Arial;}{\f27\froman\fcharset128\fprq1 Mincho;}}
{\stylesheet{\loch\af5\hich\af5\dbch\f27\fs20\snext0 Normal;}}
\pard\plain
{\dbch\f27\fs20 \'82\'b1\'82\'ea\'82\'cd}
{\loch\f5 Test }
{\dbch\f27\'82\'c5\'82\'b7\'81B}
\par}

If one or all of \loch, \hich, and \dbch are missing from the style sheet definition (or the character set doesn't match), Word J will apply appropriate fonts to each character run in the style using the bulleted rules below.

Control
Word
Font that Word J will apply
\loch Same font as \f.
\hich Any font whose character set is ANSI_CHARSET.
\dbch Any font whose character set is SHIFTJIS_CHARSET.

If the composite font control words are missing from the character run, Word J will interpret all characters below 0x80 as a \loch run. Characters above or equal to 0x80 will be determined using the following rules:

New Far East Control Words Created by Word 6J

Control
Word
Description
Associated Character Properties
\loch The text consists of single-byte low-ANSI (0x00-0x7F) characters.
\hich The text consists of single-byte high-ANSI (0x80-0xFF) characters.
\dbch The text consists of double-byte characters.
Borders
\brdrdash Dashed border.
\brdrdashd Dash-dotted border.
\brdrdashdd Dash-dot-dotted border.
Character Properties
\uldash Dashed underline.
\uldashd Dash-dotted underline.
\uldashdd Dash-dot-dotted underline.
\ulhair Hairline underline.
\ulth Thick underline.
\ulwave Wave underline.
\accnone No accent characters (over dot / over comma).
\accdot Over dot accent.
\acccomma Over comma accent.
\charscalex Character width scaling.
\striked1 Double strikethrough. \striked0 turns it off.
Document Formatting Properties
\horzdoc Horizontal rendering.
\vertdoc Vertical rendering.
\*\fchars List of following kinsoku characters.
\*\lchars List of leading kinsoku characters.
\jcompress Compressing justification (default).
\jexpand Expanding justification.
\gutterprl Parallel gutter.
\dgsnap Snap to drawing grid.
\dghspaceN Drawing grid horizontal spacing in twips (the default is 120).
\dgvspaceN Drawing grid vertical spacing in twips (the default is 120).
\dghoriginN Drawing grid horizontal origin in twips (the default is 1701).
\dgvoriginN Drawing grid vertical origin in twips (the default is 1984).
\dghshowN Show Nth horizontal drawing gridline (the default is 3).
\dgvshowN Show Nth vertical drawing gridline (the default is 0).
\twoonone Print two logical pages on one physical page.
\lnongrid Define line based on the grid.
Bullets and Numbering
\pndecd Double-byte decimal numbering (\*arabic\*dbchar).
\pndbnum Kanji numbering without the digit character (\*dbnum1).
\pnaiu 46 phonetic katakana characters in "aiueo" order (\*aiueo).
\pnaiud 46 phonetic double-byte katakana characters (\*aiueo\*dbchar).
\pniroha 46 phonetic katakana characters in "iroha" order (\*iroha).
\pnirohad 46 phonetic double-byte katakana characters (\*iroha\*dbchar).
\pncnum 20 numbered list in circle (\*circlenum).
\pnuldash Dashed underline.
\pnuldashd Dash-dotted underline.
\pnuldashdd Dash-dot-dotted underline.
\pnulhair Hairline underline.
\pnulth Thick underline.
\pnulwave Wave underline.
Drawing Objects
\dptxlrtb Text box flows from left to right and top to bottom (default).
\dptxtbrl Text box flows from right to left and top to bottom.
\dptxbtlr Text box flows from left to right and bottom to top.
\dptxlrtbv Text box flows from left to right and top to bottom, vertically.
\dptxtbrlv Text box flows from top to bottom and right to left, vertically.
Frame Properties
\frmtxlrtb Frame box flows from left to right and top to bottom (default).
\frmtxtbrl Frame box flows right to left and top to bottom.
\frmtxbtlr Frame box flows left to right and bottom to top.
\frmtxlrtbv Frame box flows left to right and top to bottom, vertical.
\frmtxtbrlv Frame box flows top to bottom and right to left, vertical.
Index Entries
\*\pxe "Yomi" (pronunciation) for index entry.
Paragraph Properties
\nocwrap No character wrapping.
\nowwrap No word wrapping.
\qd Distributed.
\nooverflow No overflow period and comma.
\aspalpha Auto spacing between DBC and English.
\aspnum Auto spacing between DBC and numbers.
\fahang Font alignment → Hanging.
\facenter Font alignment → Center.
\faroman Font alignment → Roman (default).
\favar Font alignment → Upholding variable.
\fafixed Font alignment ? Upholding fixed.
Section Formatting Properties
\horzsect Horizontal rendering.
\vertsect Vertical rendering.
\pgndecd Double-byte decimal numbering.
\pgndbnum Kanji numbering without the digit character.
\pgndbnumd Kanji numbering with the digit character.
Special Characters
\zwbo Zero-width break opportunity. Used to insert break opportunity between two characters.
\zwnbo Zero-width nonbreak opportunity. Used to remove break opportunity between two characters.
\qmspace One-quarter em space.
Table Formatting
\cldglu Diagonal line (top left to bottom right). Followed by <brdr>, which defines the properties of the diagonal border (\cldglu <brdr>).
\cldgll Diagonal line (top right to bottom left). Followed by <brdr>, which defines the properties of the diagonal border (\cldgll <brdr>).
\cltxlrtb Text in a cell flows from left to right and top to bottom (default).
\cltxtbrl Text in a cell flows right to left and top to bottom.
\cltxbtlr Text in a cell flows left to right and bottom to top.
\cltxlrtbv Text in a cell flows left to right and top to bottom, vertical.
\cltxtbrlv Text in a cell flows top to bottom and right to left, vertical.
\clvmgf The first cell in a range of table cells to be vertically merged.
\clvmrg Contents of the table cell are vertically merged with those of the preceding cell.
\clvertalt Cell top align.
\clvertalc Cell vertically center align.
\clvertalb Cell bottom align.
Tabs
\tlmdot Leader middle dots.

New Far East Control Words Created by Asian Versions of Word 97

Control
Word
Meaning
Character Formatting Properties
\cgridN Character grid.
\g Destination related to character grids.
\gcw Grid column width.
\gridtbl Destination keyword related to character grids.
\nosectexpand Disable character space basement.
Paragraph Formatting Properties
\adjustright Automatically adjust right indent when document grid is defined.
\nosnaplinegrid Disable snap line to grid.
\faauto Font alignment the default setting for this is "Auto."
Borders
\brdrframe Border resembles a frame
Bullets and Numbers
\pnaiueo 46 phonetic Katakana characters in "aiueo" order (*aiueo).
\pnaiueod 46 phonetic double-byte Katakana characters (*aiueo*dbchar).
\pndbnumd Kanji numbering with the digit character (*dbnum2).
\pndbnumt Kanji numbering 3 (*dbnum3).
\pndbnuml Kanji numbering 3 (*dbnum3).
\pndbnumk Kanji numbering 4 (*dbnum4).
\pnganada Korean numbering 2 (*ganada).
\pngbnum Chinese numbering 1 (*gb1).
\pngbnumd Chinese numbering 2 (*gb2).
\pngbnuml Chinese numbering 3 (*gb3).
\pngbnumk Chinese numbering 4 (*gb4).
\pnzodiac Chinese Zodiac numbering 1 (*zodiac1).
\pnzodiacd Chinese Zodiac numbering 2 (*zodiac2).
\pnzodiacl Chinese Zodiac numbering 3 (*zodiac3).
\pnganada Korean numbering 1 (*ganada).
\pnchosung Korean numbering 2 (*chosung).
Endnotes and Footnotes
\ftnnchosung Footnote Korean numbering 1 (*chosung).
\ftnncnum Footnote Circle numbering (*circlenum).
\ftnndbnum Foonote Kanji numbering without the digit character (*dbnum1).
\ftnndbnumd Footnote Kanji numbering with the digit character (*dbnum2).
\ftnndbnumt Footnote Kanji numbering 3 (*dbnum3).
\ftnndbnumk Footnote Kanji numbering 4 (*dbnum4).
\ftnndbar Footnote double-byte numbering (*dbchar).
\ftnnganada Footnote Korean numbering 2 (*ganada).
\ftnngbnum Footnote Chinese numbering 1 (*gb1).
\ftnngbnumd Footnote Chinese numbering 2 (*gb2).
\ftnngbnuml Footnote Chinese numbering 3 (*gb3).
\ftnngbnumk Footnote Chinese numbering 4 (*gb4).
\ftnnzodiac Footnote numbering— Chinese Zodiac numbering 1 (* zodiac1)
\ftnnzodiacd Footnote numbering— Chinese Zodiac numbering 2 (* zodiac2)
\ftnnzodiacl Footnote numbering— Chinese Zodiac numbering 3 (* zodiac3).
\aftnnchosung Endnote Korean numbering 1 (*chosung).
\aftnncnum Endnote Circle numbering (*circlenum).
\aftnndbnum Endnote Kanji numbering without the digit character (*dbnum1).
\aftnndbnumd Endnote Kanji numbering with the digit character (*dbnum2).
\aftnndbnumt Endnote Kanji numbering 3 (*dbnum3).
\aftnndbnumk Endnote Kanji numbering 4 (*dbnum4).
\aftnndbar Endnote double-byte numbering (*dbchar).
\aftnnganada Endnote Korean numbering 2 (*ganada).
\aftnngbnum Endnote Chinese numbering 1 (*gb1).
\aftnngbnumd Endnote Chinese numbering 2 (*gb2).
\aftnngbnuml Endnote Chinese numbering 3 (*gb3).
\aftnngbnumk Endnote Chinese numbering 4 (*gb4).
\aftnnzodiac Endnote numbering— Chinese Zodiac numbering 1 (* zodiac1)
\aftnnzodiacd Endnote numbering— Chinese Zodiac numbering 2 (* zodiac2)
\aftnnzodiacl Endnote numbering— Chinese Zodiac numbering 3 (* zodiac3).
Section Formatting Properties
\pgnchosung Korean numbering 1 (* chosung).
\pgncnum Circle numbering (*circlenum).
\pgndbnumt Kanji numbering 3 (*dbnum3).
\pgndbnumk Kanji numbering 4 (*dbnum4).
\pgnganada Korean numbering 2 (*ganada)
\pgngbnum Chinese numbering 1 (*gb1).
\pgngbnumd Chinese numbering 2 (*gb2).
\pgngbnuml Chinese numbering 3 (*gb3).
\pgngbnumk Chinese numbering 4 (*gb4).
\pgnzodiac Chinese Zodiac numbering 1 (*zodiac1).
\pgnzodiacd Chinese Zodiac numbering 2 (*zodiac2).
\pgnzodiacl Chinese Zodiac numbering 3 (*zodiac3).
\sectexpandN Character space basement (character pitch minus font size) N in device independent units (a device independent unit is 1/294912th of an inch).
\sectlinegridN Line grid, where N is the line pitch in 20ths of a point.
\sectdefaultcl Default state of section. Indicates \sectspecifycl and \sectspecifyl are not emitted.
\sectspecifycl Specify number of characters per line only.
\sectspecifyl Specify both number of characters per line and number of lines per page.
Document Formatting Properties
\dgmargin Grid to follow margins.
Index Entries
\yxe Pronunciation (or heading) for index entry, used in phonetic sorting.

New Far East Control Words Created by Word 2000

Document Formatting Properties
\jsksu Indicates that the strict Kinsoku set must be used for Japanese; \jsku should not be present if \ksulangN is present and the language N is Japanese.
\ksulangN Indicates what language N the customized kinsoku characters defined in the \fchars and \lchars destinations belong to.
Section Formatting Properties
\sectspecifygenN Indicates that text should snap to the character grid. Note that the N is part of the keyword.
Paragraph Formatting Properties
\cufiN First-line indent in hundredths of a character unit; overrides \fiN although they should both be emitted with equivalent values.
\culiN Left indent (space before) in character units. Behaves like \linN and overrides \liN and \linN although they should all be emitted with equivalent values.
\curiN Right indent (space after) in character units. Behaves like \rinN and overrides \riN and \rinN although they should all be emitted with equivalent values.
\lisbN Space before in hundredths of a character unit. Overrides \sbN although they should both be emitted with equivalent values.
\lisaN Space after in hundredths of a character unit. Overrides \saN although they should both be emitted with equivalent values.
Character Formatting Properties
\horzvertN Text in the group flows in a direction opposite to that of the main document (Horizontal in vertical and vertical in horizontal):

0 — Switched text is uncompressed.

1 — Switched text is compressed to current line height.

\twoinoneN Text in the group is displayed as two half-height lines within a line.

0 — Text is not enclosed.

1 — Text is enclosed in parentheses.

2 — Text is enclosed in square brackets ([]).

3 — Text is enclosed in angled brackets (<>).

4 — Text is enclosed in braces ({}).

\fittextN Fit the text in the current group in N twips. When N is set to -1 (\fittext-1) it indicates a continuation of the previous \fittextN run. In other words {\fittext1000 Fit this} {\fittext-1 text} fits the string "Fit this text" in 1000 twips.

latex2rtf | RTF Index | Next Section