latex2rtf | RTF Index | Next Section |
Word 2000 contains full support for all Far East features introduced in all previous Asian versions of Word, and so it has the ability to read and write RTF keywords related to such features. All of them should be described in the appropriate sub-section of the Contents of an RTF File section, but details specific to that handling of Far East characters are given here.
An escape expression (for example, \'hh, \\, or \{) is usable in all RTF control words.
In general RTF should be written out with all characters above 0x80 in the escaped form, \'hh.
Character code | Write out as |
0x00 <= ch < 0x20 | Escaped (\'hh) |
0x20 <= ch < 0x80 | Raw (non-escaped) character |
0x80 <= ch <= 0xFF | Escaped (\'hh) |
0x5C, 0x7B, 0x7D (special RTF characters \,{, or }) | Escaped (\'hh) |
When the RTF reader encounters raw characters in the leading-byte range of the double-byte character, it regards the next character as the trailing byte of the double-byte character and combines the two characters into one double-byte character.
Leading byte | Trailing byte | Validity |
Escaped | Raw (0x20 <= ch <= 0x7f) | Valid (standard format for double-byte character) |
Escaped | Escaped (other) | Valid (standard format for double-byte character) |
Raw | Raw | Valid (RTF-J format for double-byte character) |
Raw | Escaped | Invalid |
Note†††
Characters that are special RTF symbols (\,{, or }) should always be escaped, preferably using the \'hh syntax.
Word J specifies the character set in the font table using \fcharset
. Word J interprets \cpg437
as \fcharset0
and \cpg932
as \fcharset128
if it encounters these control words when reading RTF. If both \fcharset
and \cpg
appear in the font table, \cpg
is ignored.
Word maps single-byte characters according to character set information (for example, Macintosh to ANSI) and leaves double-byte characters unmapped.
RTF-J
control words |
Definition and the interpretation in Word |
\jis |
RTF-J uses \jis as a control word for character set. Word J interprets this as \ansi , which is the default character set used if the character set is not defined.
|
\fjminchou and \fjgothic |
RTF-J uses \fjminchou and \fjgothic to specify font family. Word J interprets these as \fnil , which is the default font family.
|
If \cpg
or \fcharset
control words are not present, Word J uses the text metrics of the font before determining the character set of these fonts. If the font is unknown, Word J assumes it is SHIFTJIS_CHARSET.
Word J defines control words to specify composite fonts as associated character properties. These control words follow the rule of associated character properties and understand font designation (\af
). All other <aprops> are ignored in Word J.
<atext> | <losbrun> | <hisbrun> | <dbrun> |
<losbrun> |
\hich \af & <aprops> \dbch \af & <aprops> \loch <ptext>
|
<hisbrun> |
\loch \af & <aprops> \dbch \af & <aprops> \hich <ptext>
|
<dbrun> |
\loch \af & <aprops> \hich \af & <aprops> \dbch <ptext>
|
Control Word | Definition |
\loch | Specifies a run of the characters in the low-ANSI (0x00-0x7F) area. |
\hich | For the characters in the high-ANSI (0x80-0xFF) area. |
\dbch | Specifies a run of the double-byte characters. |
Word J writes out associated character properties in the styles. In the style sheet, the <dbrun> definition should be used for compatibility with applications that have transparent readers.
{\stylesheet{\loch\af5\hich\af5\dbch\f27\fs20\snext0 Normal;}}
If the composite font definition matches the style, only the control word (\loch
, \hich
, or \dbch
) will be used to distinguish the type of run, together with the font information for transparent readers.
{\fonttbl{\f5\fswiss\fcharset0\fprq2 Arial;}{\f27\froman\fcharset128\fprq1 Mincho;}} {\stylesheet{\loch\af5\hich\af5\dbch\f27\fs20\snext0 Normal;}} \pard\plain {\dbch\f27\fs20 \'82\'b1\'82\'ea\'82\'cd} {\loch\f5 Test } {\dbch\f27\'82\'c5\'82\'b7\'81B} \par}
If one or all of \loch
, \hich
, and \dbch
are missing from the style sheet definition (or the character set doesn't match), Word J will apply appropriate fonts to each character run in the style using the bulleted rules below.
Control
Word |
Font that Word J will apply |
\loch | Same font as \f. |
\hich | Any font whose character set is ANSI_CHARSET. |
\dbch | Any font whose character set is SHIFTJIS_CHARSET. |
If the composite font control words are missing from the character run, Word J will interpret all characters below 0x80 as a \loch
run. Characters above or equal to 0x80 will be determined using the following rules:
\dbch
run (one double-byte character). For example: \'99\'47‡
\hich
run (two high-ANSI or low-ANSI characters). For example: \'99\'FF‡ÿ
\hich
run (one high-ANSI character). For example: \'99\parà
\hich
run (one high-ANSI character). For example: \'FFàÿ
Control
Word |
Description |
Associated Character Properties | |
\loch | The text consists of single-byte low-ANSI (0x00-0x7F) characters. |
\hich | The text consists of single-byte high-ANSI (0x80-0xFF) characters. |
\dbch | The text consists of double-byte characters. |
Borders | |
\brdrdash | Dashed border. |
\brdrdashd | Dash-dotted border. |
\brdrdashdd | Dash-dot-dotted border. |
Character Properties | |
\uldash | Dashed underline. |
\uldashd | Dash-dotted underline. |
\uldashdd | Dash-dot-dotted underline. |
\ulhair | Hairline underline. |
\ulth | Thick underline. |
\ulwave | Wave underline. |
\accnone | No accent characters (over dot / over comma). |
\accdot | Over dot accent. |
\acccomma | Over comma accent. |
\charscalex | Character width scaling. |
\striked1 |
Double strikethrough. \striked0 turns it off.
|
Document Formatting Properties | |
\horzdoc | Horizontal rendering. |
\vertdoc | Vertical rendering. |
\*\fchars | List of following kinsoku characters. |
\*\lchars | List of leading kinsoku characters. |
\jcompress | Compressing justification (default). |
\jexpand | Expanding justification. |
\gutterprl | Parallel gutter. |
\dgsnap | Snap to drawing grid. |
\dghspaceN | Drawing grid horizontal spacing in twips (the default is 120). |
\dgvspaceN | Drawing grid vertical spacing in twips (the default is 120). |
\dghoriginN | Drawing grid horizontal origin in twips (the default is 1701). |
\dgvoriginN | Drawing grid vertical origin in twips (the default is 1984). |
\dghshowN | Show Nth horizontal drawing gridline (the default is 3). |
\dgvshowN | Show Nth vertical drawing gridline (the default is 0). |
\twoonone | Print two logical pages on one physical page. |
\lnongrid | Define line based on the grid. |
Bullets and Numbering | |
\pndecd |
Double-byte decimal numbering (\ *arabic\*dbchar).
|
\pndbnum |
Kanji numbering without the digit character (\ *dbnum1).
|
\pnaiu |
46 phonetic katakana characters in "aiueo" order (\ *aiueo).
|
\pnaiud |
46 phonetic double-byte katakana characters (\ *aiueo\*dbchar).
|
\pniroha |
46 phonetic katakana characters in "iroha" order (\ *iroha).
|
\pnirohad |
46 phonetic double-byte katakana characters (\ *iroha\*dbchar).
|
\pncnum |
20 numbered list in circle (\ *circlenum).
|
\pnuldash | Dashed underline. |
\pnuldashd | Dash-dotted underline. |
\pnuldashdd | Dash-dot-dotted underline. |
\pnulhair | Hairline underline. |
\pnulth | Thick underline. |
\pnulwave | Wave underline. |
Drawing Objects | |
\dptxlrtb | Text box flows from left to right and top to bottom (default). |
\dptxtbrl | Text box flows from right to left and top to bottom. |
\dptxbtlr | Text box flows from left to right and bottom to top. |
\dptxlrtbv | Text box flows from left to right and top to bottom, vertically. |
\dptxtbrlv | Text box flows from top to bottom and right to left, vertically. |
Frame Properties | |
\frmtxlrtb | Frame box flows from left to right and top to bottom (default). |
\frmtxtbrl | Frame box flows right to left and top to bottom. |
\frmtxbtlr | Frame box flows left to right and bottom to top. |
\frmtxlrtbv | Frame box flows left to right and top to bottom, vertical. |
\frmtxtbrlv | Frame box flows top to bottom and right to left, vertical. |
Index Entries | |
\*\pxe | "Yomi" (pronunciation) for index entry. |
Paragraph Properties | |
\nocwrap | No character wrapping. |
\nowwrap | No word wrapping. |
\qd | Distributed. |
\nooverflow | No overflow period and comma. |
\aspalpha | Auto spacing between DBC and English. |
\aspnum | Auto spacing between DBC and numbers. |
\fahang | Font alignment → Hanging. |
\facenter | Font alignment → Center. |
\faroman | Font alignment → Roman (default). |
\favar | Font alignment → Upholding variable. |
\fafixed | Font alignment ? Upholding fixed. |
Section Formatting Properties | |
\horzsect | Horizontal rendering. |
\vertsect | Vertical rendering. |
\pgndecd | Double-byte decimal numbering. |
\pgndbnum | Kanji numbering without the digit character. |
\pgndbnumd | Kanji numbering with the digit character. |
Special Characters | |
\zwbo | Zero-width break opportunity. Used to insert break opportunity between two characters. |
\zwnbo | Zero-width nonbreak opportunity. Used to remove break opportunity between two characters. |
\qmspace | One-quarter em space. |
Table Formatting | |
\cldglu |
Diagonal line (top left to bottom right). Followed by <brdr>, which defines the properties of the diagonal border (\cldglu <brdr>).
|
\cldgll |
Diagonal line (top right to bottom left). Followed by <brdr>, which defines the properties of the diagonal border (\cldgll <brdr>).
|
\cltxlrtb | Text in a cell flows from left to right and top to bottom (default). |
\cltxtbrl | Text in a cell flows right to left and top to bottom. |
\cltxbtlr | Text in a cell flows left to right and bottom to top. |
\cltxlrtbv | Text in a cell flows left to right and top to bottom, vertical. |
\cltxtbrlv | Text in a cell flows top to bottom and right to left, vertical. |
\clvmgf | The first cell in a range of table cells to be vertically merged. |
\clvmrg | Contents of the table cell are vertically merged with those of the preceding cell. |
\clvertalt | Cell top align. |
\clvertalc | Cell vertically center align. |
\clvertalb | Cell bottom align. |
Tabs | |
\tlmdot | Leader middle dots. |
Control
Word |
Meaning |
Character Formatting Properties | |
\cgrid N
|
Character grid. |
\g
|
Destination related to character grids. |
\gcw
|
Grid column width. |
\gridtbl
|
Destination keyword related to character grids. |
\nosectexpand
|
Disable character space basement. |
Paragraph Formatting Properties | |
\adjustright
|
Automatically adjust right indent when document grid is defined. |
\nosnaplinegrid
|
Disable snap line to grid. |
\faauto
|
Font alignment the default setting for this is "Auto." |
Borders | |
\brdrframe
|
Border resembles a frame |
Bullets and Numbers | |
\pnaiueo
|
46 phonetic Katakana characters in "aiueo" order (*aiueo). |
\pnaiueod
|
46 phonetic double-byte Katakana characters (*aiueo*dbchar). |
\pndbnumd
|
Kanji numbering with the digit character (*dbnum2). |
\pndbnumt
|
Kanji numbering 3 (*dbnum3). |
\pndbnuml
|
Kanji numbering 3 (*dbnum3). |
\pndbnumk
|
Kanji numbering 4 (*dbnum4). |
\pnganada
|
Korean numbering 2 (*ganada). |
\pngbnum
|
Chinese numbering 1 (*gb1). |
\pngbnumd
|
Chinese numbering 2 (*gb2). |
\pngbnuml
|
Chinese numbering 3 (*gb3). |
\pngbnumk
|
Chinese numbering 4 (*gb4). |
\pnzodiac
|
Chinese Zodiac numbering 1 (*zodiac1). |
\pnzodiacd
|
Chinese Zodiac numbering 2 (*zodiac2). |
\pnzodiacl
|
Chinese Zodiac numbering 3 (*zodiac3). |
\pnganada
|
Korean numbering 1 (*ganada). |
\pnchosung
|
Korean numbering 2 (*chosung). |
Endnotes and Footnotes | |
\ftnnchosung
|
Footnote Korean numbering 1 (*chosung). |
\ftnncnum
|
Footnote Circle numbering (*circlenum). |
\ftnndbnum
|
Foonote Kanji numbering without the digit character (*dbnum1). |
\ftnndbnumd
|
Footnote Kanji numbering with the digit character (*dbnum2). |
\ftnndbnumt
|
Footnote Kanji numbering 3 (*dbnum3). |
\ftnndbnumk
|
Footnote Kanji numbering 4 (*dbnum4). |
\ftnndbar
|
Footnote double-byte numbering (*dbchar). |
\ftnnganada
|
Footnote Korean numbering 2 (*ganada). |
\ftnngbnum
|
Footnote Chinese numbering 1 (*gb1). |
\ftnngbnumd
|
Footnote Chinese numbering 2 (*gb2). |
\ftnngbnuml
|
Footnote Chinese numbering 3 (*gb3). |
\ftnngbnumk
|
Footnote Chinese numbering 4 (*gb4). |
\ftnnzodiac
|
Footnote numbering— Chinese Zodiac numbering 1 (* zodiac1) |
\ftnnzodiacd
|
Footnote numbering— Chinese Zodiac numbering 2 (* zodiac2) |
\ftnnzodiacl
|
Footnote numbering— Chinese Zodiac numbering 3 (* zodiac3). |
\aftnnchosung
|
Endnote Korean numbering 1 (*chosung). |
\aftnncnum
|
Endnote Circle numbering (*circlenum). |
\aftnndbnum
|
Endnote Kanji numbering without the digit character (*dbnum1). |
\aftnndbnumd
|
Endnote Kanji numbering with the digit character (*dbnum2). |
\aftnndbnumt
|
Endnote Kanji numbering 3 (*dbnum3). |
\aftnndbnumk
|
Endnote Kanji numbering 4 (*dbnum4). |
\aftnndbar
|
Endnote double-byte numbering (*dbchar). |
\aftnnganada
|
Endnote Korean numbering 2 (*ganada). |
\aftnngbnum
|
Endnote Chinese numbering 1 (*gb1). |
\aftnngbnumd
|
Endnote Chinese numbering 2 (*gb2). |
\aftnngbnuml
|
Endnote Chinese numbering 3 (*gb3). |
\aftnngbnumk
|
Endnote Chinese numbering 4 (*gb4). |
\aftnnzodiac
|
Endnote numbering— Chinese Zodiac numbering 1 (* zodiac1) |
\aftnnzodiacd
|
Endnote numbering— Chinese Zodiac numbering 2 (* zodiac2) |
\aftnnzodiacl
|
Endnote numbering— Chinese Zodiac numbering 3 (* zodiac3). |
Section Formatting Properties | |
\pgnchosung
|
Korean numbering 1 (* chosung). |
\pgncnum
|
Circle numbering (*circlenum). |
\pgndbnumt
|
Kanji numbering 3 (*dbnum3). |
\pgndbnumk
|
Kanji numbering 4 (*dbnum4). |
\pgnganada
|
Korean numbering 2 (*ganada) |
\pgngbnum
|
Chinese numbering 1 (*gb1). |
\pgngbnumd
|
Chinese numbering 2 (*gb2). |
\pgngbnuml
|
Chinese numbering 3 (*gb3). |
\pgngbnumk
|
Chinese numbering 4 (*gb4). |
\pgnzodiac
|
Chinese Zodiac numbering 1 (*zodiac1). |
\pgnzodiacd
|
Chinese Zodiac numbering 2 (*zodiac2). |
\pgnzodiacl
|
Chinese Zodiac numbering 3 (*zodiac3). |
\sectexpand N
|
Character space basement (character pitch minus font size) N in device independent units (a device independent unit is 1/294912th of an inch). |
\sectlinegrid N
|
Line grid, where N is the line pitch in 20ths of a point. |
\sectdefaultcl
|
Default state of section. Indicates \sectspecifycl and \sectspecifyl are not emitted.
|
\sectspecifycl
|
Specify number of characters per line only. |
\sectspecifyl
|
Specify both number of characters per line and number of lines per page. |
Document Formatting Properties | |
\dgmargin
|
Grid to follow margins. |
Index Entries | |
\yxe | Pronunciation (or heading) for index entry, used in phonetic sorting. |
Document Formatting Properties | |
\jsksu
|
Indicates that the strict Kinsoku set must be used for Japanese; \jsku should not be present if \ksulang N is present and the language N is Japanese.
|
\ksulang N
|
Indicates what language N the customized kinsoku characters defined in the \fchars and \lchars destinations belong to.
|
Section Formatting Properties | |
\sectspecifygenN
|
Indicates that text should snap to the character grid. Note that the N is part of the keyword. |
Paragraph Formatting Properties | |
\cufi N
|
First-line indent in hundredths of a character unit; overrides \fi N although they should both be emitted with equivalent values.
|
\culi N
|
Left indent (space before) in character units. Behaves like \linN and overrides \li N and \lin N although they should all be emitted with equivalent values.
|
\curi N
|
Right indent (space after) in character units. Behaves like \rinN and overrides \ri N and \rin N although they should all be emitted with equivalent values.
|
\lisb N
|
Space before in hundredths of a character unit. Overrides \sb N although they should both be emitted with equivalent values.
|
\lisaN |
Space after in hundredths of a character unit. Overrides \sa N although they should both be emitted with equivalent values.
|
Character Formatting Properties | |
\horzvertN |
Text in the group flows in a direction opposite to that of the main document (Horizontal in vertical and vertical in horizontal):
0 — Switched text is uncompressed. 1 — Switched text is compressed to current line height. |
\twoinoneN |
Text in the group is displayed as two half-height lines within a line.
0 — Text is not enclosed. 1 — Text is enclosed in parentheses. 2 — Text is enclosed in square brackets ([]). 3 — Text is enclosed in angled brackets (<>). 4 — Text is enclosed in braces ({}). |
\fittext N
|
Fit the text in the current group in N twips. When N is set to -1 (\fittext -1) it indicates a continuation of the previous \fittext N run. In other words {\fittext1000 Fit this} {\fittext-1 text} fits the string "Fit this text" in 1000 twips.
|
latex2rtf | RTF Index | Next Section |