latex2rtf | RTF Index | Next Section

Far East Support

Word 2000 contains full support for all Far East features introduced in all previous Asian versions of Word, and so it has the ability to read and write RTF keywords related to such features. All of them should be described in the appropriate sub-section of the Contents of an RTF File section, but details specific to that handling of Far East characters are given here.

Escaped Expressions

An escape expression (for example, \'hh, \\, or \{) is usable in all RTF control words.

Writer:

In general RTF should be written out with all characters above 0x80 in the escaped form, \'hh.

Character code	Write out as
0x00 <= ch < 0x20	Escaped (\'hh)
0x20 <= ch < 0x80	Raw (non-escaped) character
0x80 <= ch <= 0xFF	Escaped (\'hh)
0x5C, 0x7B, 0x7D (special RTF characters \,{, or })	Escaped (\'hh)

Reader:

When the RTF reader encounters raw characters in the leading-byte range of the double-byte character, it regards the next character as the trailing byte of the double-byte character and combines the two characters into one double-byte character.

Leading byte	Trailing byte	Validity
Escaped	Raw (0x20 <= ch <= 0x7f)	Valid (standard format for double-byte character)
Escaped	Escaped (other)	Valid (standard format for double-byte character)
Raw	Raw	Valid (RTF-J format for double-byte character)
Raw	Escaped	Invalid

Note†††Characters that are special RTF symbols (\,{, or }) should always be escaped, preferably using the \'hh syntax.

Character Set

Word J specifies the character set in the font table using \fcharset. Word J interprets \cpg437 as \fcharset0 and \cpg932 as \fcharset128 if it encounters these control words when reading RTF. If both \fcharset and \cpg appear in the font table, \cpg is ignored.

Character Mapping

Word maps single-byte characters according to character set information (for example, Macintosh to ANSI) and leaves double-byte characters unmapped.

Font Family

RTF-J control words	Definition and the interpretation in Word
\jis	RTF-J uses `\jis` as a control word for character set. Word J interprets this as `\ansi`, which is the default character set used if the character set is not defined.
\fjminchou and \fjgothic	RTF-J uses `\fjminchou` and `\fjgothic` to specify font family. Word J interprets these as `\fnil`, which is the default font family.

ShiftJIS Font Without \cpg or \fcharset

If \cpg or \fcharset control words are not present, Word J uses the text metrics of the font before determining the character set of these fonts. If the font is unknown, Word J assumes it is SHIFTJIS_CHARSET.

Composite Fonts (Associated Fonts for International Runs)

Word J defines control words to specify composite fonts as associated character properties. These control words follow the rule of associated character properties and understand font designation (\af). All other <aprops> are ignored in Word J.

<atext>	<losbrun> \| <hisbrun> \| <dbrun>
<losbrun>	`\hich` *`\af`* & <aprops> `\dbch` *`\af`* & <aprops> `\loch` <ptext>
<hisbrun>	`\loch` *`\af`* & <aprops> `\dbch` *`\af`* & <aprops> `\hich` <ptext>
<dbrun>	`\loch` *`\af`* & <aprops> `\hich` *`\af`* & <aprops> `\dbch` <ptext>
Control Word	Definition
\loch	Specifies a run of the characters in the low-ANSI (0x00-0x7F) area.
\hich	For the characters in the high-ANSI (0x80-0xFF) area.
\dbch	Specifies a run of the double-byte characters.

Word J writes out associated character properties in the styles. In the style sheet, the <dbrun> definition should be used for compatibility with applications that have transparent readers.

{\stylesheet{\loch\af5\hich\af5\dbch\f27\fs20\snext0 Normal;}}

If the composite font definition matches the style, only the control word (\loch, \hich, or \dbch) will be used to distinguish the type of run, together with the font information for transparent readers.

{\fonttbl{\f5\fswiss\fcharset0\fprq2 Arial;}{\f27\froman\fcharset128\fprq1 Mincho;}}
{\stylesheet{\loch\af5\hich\af5\dbch\f27\fs20\snext0 Normal;}}
\pard\plain
{\dbch\f27\fs20 \'82\'b1\'82\'ea\'82\'cd}
{\loch\f5 Test }
{\dbch\f27\'82\'c5\'82\'b7\'81B}
\par}

If one or all of \loch, \hich, and \dbch are missing from the style sheet definition (or the character set doesn't match), Word J will apply appropriate fonts to each character run in the style using the bulleted rules below.

Control Word	Font that Word J will apply
\loch	Same font as \f.
\hich	Any font whose character set is ANSI_CHARSET.
\dbch	Any font whose character set is SHIFTJIS_CHARSET.

If the composite font control words are missing from the character run, Word J will interpret all characters below 0x80 as a \loch run. Characters above or equal to 0x80 will be determined using the following rules:

If the character is in the leading-byte range and the next character is in the trailing-byte range of a double-byte character, it will be treated as a \dbch run (one double-byte character). For example:
```
\'99\'47‡
```
If the character is in the leading-byte range of a double-byte character but the next character is not in the trailing-byte range, it will be treated as a \hich run (two high-ANSI or low-ANSI characters). For example:
```
\'99\'FF‡ÿ
```
If the character is in the leading-byte range of a double-byte character and is the last character in the run, it will be treated as a \hich run (one high-ANSI character). For example:
```
\'99\parà
```
If the character is not in the leading-byte range of a double-byte character, it will be treated as a \hich run (one high-ANSI character). For example:
```
\'FFàÿ
```

New Far East Control Words Created by Word 6J

Control Word	Description
Associated Character Properties
\loch	The text consists of single-byte low-ANSI (0x00-0x7F) characters.
\hich	The text consists of single-byte high-ANSI (0x80-0xFF) characters.
\dbch	The text consists of double-byte characters.
Borders
\brdrdash	Dashed border.
\brdrdashd	Dash-dotted border.
\brdrdashdd	Dash-dot-dotted border.
Character Properties
\uldash	Dashed underline.
\uldashd	Dash-dotted underline.
\uldashdd	Dash-dot-dotted underline.
\ulhair	Hairline underline.
\ulth	Thick underline.
\ulwave	Wave underline.
\accnone	No accent characters (over dot / over comma).
\accdot	Over dot accent.
\acccomma	Over comma accent.
\charscalex	Character width scaling.
\striked1	Double strikethrough. `\striked0` turns it off.
Document Formatting Properties
\horzdoc	Horizontal rendering.
\vertdoc	Vertical rendering.
*\\fchars**	List of following kinsoku characters.
*\\lchars**	List of leading kinsoku characters.
\jcompress	Compressing justification (default).
\jexpand	Expanding justification.
\gutterprl	Parallel gutter.
\dgsnap	Snap to drawing grid.
\dghspaceN	Drawing grid horizontal spacing in twips (the default is 120).
\dgvspaceN	Drawing grid vertical spacing in twips (the default is 120).
\dghoriginN	Drawing grid horizontal origin in twips (the default is 1701).
\dgvoriginN	Drawing grid vertical origin in twips (the default is 1984).
\dghshowN	Show Nth horizontal drawing gridline (the default is 3).
\dgvshowN	Show Nth vertical drawing gridline (the default is 0).
\twoonone	Print two logical pages on one physical page.
\lnongrid	Define line based on the grid.
Bullets and Numbering
\pndecd	Double-byte decimal numbering (*`\`arabic\dbchar*).
\pndbnum	Kanji numbering without the digit character (*`\`dbnum1**).
\pnaiu	46 phonetic katakana characters in "aiueo" order (*`\`aiueo**).
\pnaiud	46 phonetic double-byte katakana characters (*`\`aiueo\dbchar*).
\pniroha	46 phonetic katakana characters in "iroha" order (*`\`iroha**).
\pnirohad	46 phonetic double-byte katakana characters (*`\`iroha\dbchar*).
\pncnum	20 numbered list in circle (*`\`circlenum**).
\pnuldash	Dashed underline.
\pnuldashd	Dash-dotted underline.
\pnuldashdd	Dash-dot-dotted underline.
\pnulhair	Hairline underline.
\pnulth	Thick underline.
\pnulwave	Wave underline.
Drawing Objects
\dptxlrtb	Text box flows from left to right and top to bottom (default).
\dptxtbrl	Text box flows from right to left and top to bottom.
\dptxbtlr	Text box flows from left to right and bottom to top.
\dptxlrtbv	Text box flows from left to right and top to bottom, vertically.
\dptxtbrlv	Text box flows from top to bottom and right to left, vertically.
Frame Properties
\frmtxlrtb	Frame box flows from left to right and top to bottom (default).
\frmtxtbrl	Frame box flows right to left and top to bottom.
\frmtxbtlr	Frame box flows left to right and bottom to top.
\frmtxlrtbv	Frame box flows left to right and top to bottom, vertical.
\frmtxtbrlv	Frame box flows top to bottom and right to left, vertical.
Index Entries
*\\pxe**	"Yomi" (pronunciation) for index entry.
Paragraph Properties
\nocwrap	No character wrapping.
\nowwrap	No word wrapping.
\qd	Distributed.
\nooverflow	No overflow period and comma.
\aspalpha	Auto spacing between DBC and English.
\aspnum	Auto spacing between DBC and numbers.
\fahang	Font alignment → Hanging.
\facenter	Font alignment → Center.
\faroman	Font alignment → Roman (default).
\favar	Font alignment → Upholding variable.
\fafixed	Font alignment ? Upholding fixed.
Section Formatting Properties
\horzsect	Horizontal rendering.
\vertsect	Vertical rendering.
\pgndecd	Double-byte decimal numbering.
\pgndbnum	Kanji numbering without the digit character.
\pgndbnumd	Kanji numbering with the digit character.
Special Characters
\zwbo	Zero-width break opportunity. Used to insert break opportunity between two characters.
\zwnbo	Zero-width nonbreak opportunity. Used to remove break opportunity between two characters.
\qmspace	One-quarter em space.
Table Formatting
\cldglu	Diagonal line (top left to bottom right). Followed by <brdr>, which defines the properties of the diagonal border (`\cldglu` <brdr>).
\cldgll	Diagonal line (top right to bottom left). Followed by <brdr>, which defines the properties of the diagonal border (`\cldgll` <brdr>).
\cltxlrtb	Text in a cell flows from left to right and top to bottom (default).
\cltxtbrl	Text in a cell flows right to left and top to bottom.
\cltxbtlr	Text in a cell flows left to right and bottom to top.
\cltxlrtbv	Text in a cell flows left to right and top to bottom, vertical.
\cltxtbrlv	Text in a cell flows top to bottom and right to left, vertical.
\clvmgf	The first cell in a range of table cells to be vertically merged.
\clvmrg	Contents of the table cell are vertically merged with those of the preceding cell.
\clvertalt	Cell top align.
\clvertalc	Cell vertically center align.
\clvertalb	Cell bottom align.
Tabs
\tlmdot	Leader middle dots.

New Far East Control Words Created by Asian Versions of Word 97

Control Word	Meaning
Character Formatting Properties
`\cgrid`N	Character grid.
`\g`	Destination related to character grids.
`\gcw`	Grid column width.
`\gridtbl`	Destination keyword related to character grids.
`\nosectexpand`	Disable character space basement.
Paragraph Formatting Properties
`\adjustright`	Automatically adjust right indent when document grid is defined.
`\nosnaplinegrid`	Disable snap line to grid.
`\faauto`	Font alignment the default setting for this is "Auto."
Borders
`\brdrframe`	Border resembles a frame
Bullets and Numbers
`\pnaiueo`	46 phonetic Katakana characters in "aiueo" order (*aiueo).
`\pnaiueod`	46 phonetic double-byte Katakana characters (aiueodbchar).
`\pndbnumd`	Kanji numbering with the digit character (*dbnum2).
`\pndbnumt`	Kanji numbering 3 (*dbnum3).
`\pndbnuml`	Kanji numbering 3 (*dbnum3).
`\pndbnumk`	Kanji numbering 4 (*dbnum4).
`\pnganada`	Korean numbering 2 (*ganada).
`\pngbnum`	Chinese numbering 1 (*gb1).
`\pngbnumd`	Chinese numbering 2 (*gb2).
`\pngbnuml`	Chinese numbering 3 (*gb3).
`\pngbnumk`	Chinese numbering 4 (*gb4).
`\pnzodiac`	Chinese Zodiac numbering 1 (*zodiac1).
`\pnzodiacd`	Chinese Zodiac numbering 2 (*zodiac2).
`\pnzodiacl`	Chinese Zodiac numbering 3 (*zodiac3).
`\pnganada`	Korean numbering 1 (*ganada).
`\pnchosung`	Korean numbering 2 (*chosung).
Endnotes and Footnotes
`\ftnnchosung`	Footnote Korean numbering 1 (*chosung).
`\ftnncnum`	Footnote Circle numbering (*circlenum).
`\ftnndbnum`	Foonote Kanji numbering without the digit character (*dbnum1).
`\ftnndbnumd`	Footnote Kanji numbering with the digit character (*dbnum2).
`\ftnndbnumt`	Footnote Kanji numbering 3 (*dbnum3).
`\ftnndbnumk`	Footnote Kanji numbering 4 (*dbnum4).
`\ftnndbar`	Footnote double-byte numbering (*dbchar).
`\ftnnganada`	Footnote Korean numbering 2 (*ganada).
`\ftnngbnum`	Footnote Chinese numbering 1 (*gb1).
`\ftnngbnumd`	Footnote Chinese numbering 2 (*gb2).
`\ftnngbnuml`	Footnote Chinese numbering 3 (*gb3).
`\ftnngbnumk`	Footnote Chinese numbering 4 (*gb4).
`\ftnnzodiac`	Footnote numbering— Chinese Zodiac numbering 1 (* zodiac1)
`\ftnnzodiacd`	Footnote numbering— Chinese Zodiac numbering 2 (* zodiac2)
`\ftnnzodiacl`	Footnote numbering— Chinese Zodiac numbering 3 (* zodiac3).
`\aftnnchosung`	Endnote Korean numbering 1 (*chosung).
`\aftnncnum`	Endnote Circle numbering (*circlenum).
`\aftnndbnum`	Endnote Kanji numbering without the digit character (*dbnum1).
`\aftnndbnumd`	Endnote Kanji numbering with the digit character (*dbnum2).
`\aftnndbnumt`	Endnote Kanji numbering 3 (*dbnum3).
`\aftnndbnumk`	Endnote Kanji numbering 4 (*dbnum4).
`\aftnndbar`	Endnote double-byte numbering (*dbchar).
`\aftnnganada`	Endnote Korean numbering 2 (*ganada).
`\aftnngbnum`	Endnote Chinese numbering 1 (*gb1).
`\aftnngbnumd`	Endnote Chinese numbering 2 (*gb2).
`\aftnngbnuml`	Endnote Chinese numbering 3 (*gb3).
`\aftnngbnumk`	Endnote Chinese numbering 4 (*gb4).
`\aftnnzodiac`	Endnote numbering— Chinese Zodiac numbering 1 (* zodiac1)
`\aftnnzodiacd`	Endnote numbering— Chinese Zodiac numbering 2 (* zodiac2)
`\aftnnzodiacl`	Endnote numbering— Chinese Zodiac numbering 3 (* zodiac3).
Section Formatting Properties
`\pgnchosung`	Korean numbering 1 (* chosung).
`\pgncnum`	Circle numbering (*circlenum).
`\pgndbnumt`	Kanji numbering 3 (*dbnum3).
`\pgndbnumk`	Kanji numbering 4 (*dbnum4).
`\pgnganada`	Korean numbering 2 (*ganada)
`\pgngbnum`	Chinese numbering 1 (*gb1).
`\pgngbnumd`	Chinese numbering 2 (*gb2).
`\pgngbnuml`	Chinese numbering 3 (*gb3).
`\pgngbnumk`	Chinese numbering 4 (*gb4).
`\pgnzodiac`	Chinese Zodiac numbering 1 (*zodiac1).
`\pgnzodiacd`	Chinese Zodiac numbering 2 (*zodiac2).
`\pgnzodiacl`	Chinese Zodiac numbering 3 (*zodiac3).
`\sectexpand`N	Character space basement (character pitch minus font size) N in device independent units (a device independent unit is 1/294912^th of an inch).
`\sectlinegrid`N	Line grid, where N is the line pitch in 20ths of a point.
`\sectdefaultcl`	Default state of section. Indicates `\sectspecifycl` and `\sectspecifyl` are not emitted.
`\sectspecifycl`	Specify number of characters per line only.
`\sectspecifyl`	Specify both number of characters per line and number of lines per page.
Document Formatting Properties
`\dgmargin`	Grid to follow margins.
Index Entries
\yxe	Pronunciation (or heading) for index entry, used in phonetic sorting.

New Far East Control Words Created by Word 2000

Document Formatting Properties
`\jsksu`	Indicates that the strict Kinsoku set must be used for Japanese; `\jsku` should not be present if `\ksulang`N is present and the language N is Japanese.
`\ksulang`N	Indicates what language N the customized kinsoku characters defined in the `\fchars` and `\lchars` destinations belong to.
Section Formatting Properties
`\sectspecifygenN`	Indicates that text should snap to the character grid. Note that the N is part of the keyword.
Paragraph Formatting Properties
`\cufi`N	First-line indent in hundredths of a character unit; overrides `\fi`N although they should both be emitted with equivalent values.
`\culi`N	Left indent (space before) in character units. Behaves like \linN and overrides `\li`N and `\lin`N although they should all be emitted with equivalent values.
`\curi`N	Right indent (space after) in character units. Behaves like \rinN and overrides `\ri`N and `\rin`N although they should all be emitted with equivalent values.
`\lisb`N	Space before in hundredths of a character unit. Overrides `\sb`N although they should both be emitted with equivalent values.
\lisaN	Space after in hundredths of a character unit. Overrides `\sa`N although they should both be emitted with equivalent values.
Character Formatting Properties
\horzvertN	Text in the group flows in a direction opposite to that of the main document (Horizontal in vertical and vertical in horizontal): 0 — Switched text is uncompressed. 1 — Switched text is compressed to current line height.
\twoinoneN	Text in the group is displayed as two half-height lines within a line. 0 — Text is not enclosed. 1 — Text is enclosed in parentheses. 2 — Text is enclosed in square brackets ([]). 3 — Text is enclosed in angled brackets (<>). 4 — Text is enclosed in braces ({}).
`\fittext`N	Fit the text in the current group in N twips. When N is set to -1 (`\fittext`-1) it indicates a continuation of the previous `\fittext`N run. In other words `{\fittext1000 Fit this} {\fittext-1 text}` fits the string "Fit this text" in 1000 twips.

latex2rtf | RTF Index | Next Section