diff options
Diffstat (limited to 'webfont.txt')
-rw-r--r-- | webfont.txt | 604 |
1 files changed, 604 insertions, 0 deletions
diff --git a/webfont.txt b/webfont.txt new file mode 100644 index 0000000..591e980 --- /dev/null +++ b/webfont.txt | |||
@@ -0,0 +1,604 @@ | |||
1 | WEBSTER FONTS | ||
2 | ============= | ||
3 | |||
4 | Fonts for the Webster 1913 Dictionary. | ||
5 | For version 0.50 | ||
6 | Last edit May 5, 2001 | ||
7 | ______________________________________ | ||
8 | (This file contains some extended ASCII characters, and should be | ||
9 | transmitted in binary mode) | ||
10 | ---------------------------------------------------------------------- | ||
11 | |||
12 | This file describes a modified font for use in visualizing the | ||
13 | text of the 1913 "Webster's Revised Unabridged Dictionary" (W1913), | ||
14 | usable for the DOS operating system of IBM-compatible personal computers. | ||
15 | The electronic version of that dictionary and this font were prepared by | ||
16 | MICRA, Inc., Plainfield NJ, and are copyrighted (C) 1996 by MICRA, Inc. | ||
17 | For details of permissions and restrictions on using these files, see | ||
18 | the accompanying file "readme.web". | ||
19 | The special characters used in the electronic version of the Webster | ||
20 | 1913 are required for visualizing unusual characters used in the | ||
21 | etymology and pronunciation fields of the dictionary, in a form | ||
22 | comparable to the way they appear in the original. Since there are | ||
23 | more than 256 characters used in that dictionary, not all can be | ||
24 | represented by single-byte codes, and are instead represented by | ||
25 | SGML-style "short-form" symbols. (rather than the "entity" format | ||
26 | "&xx;" The ampersand is used frequently, and we prefer to leave | ||
27 | the "<" as the only "escape" character) of the type <x/ where x | ||
28 | is a specific code for the symbol in the dictionary. | ||
29 | See the "Short Form" section below for details about such characters. | ||
30 | Note that the symbols used here are in some cases abbreviations | ||
31 | (for compactness) of the ISO 8879 recommended symbols. If necessary, | ||
32 | the table below allows simple replacement by alternate encodings. | ||
33 | This symbol font can be loaded in IBM-compatible (x86) computers | ||
34 | running the DOS operating system by using the "font.bat" command file | ||
35 | in the "utils" directory. The fonts files for 8x14 and 8x16 fonts are | ||
36 | "web14.fnt" and "web16.fnt" respectively. | ||
37 | For those loading the Webster onto some machine other than an | ||
38 | IBM-compatible running DOS, it will be necessary to provide a | ||
39 | translation table, to convert these characters into a code that | ||
40 | can be handled by that computer. For this reason, I attach an | ||
41 | "explanation" for each character, for those who cannot view | ||
42 | the original DOS font. | ||
43 | The DOS-loadable font does not contain all of the characters needed | ||
44 | to depict the etymologies or the pronunciations. In addition to an | ||
45 | absence of several characters used in the pronunciations, no Greek letters are | ||
46 | included. The Greek words appearing in the etymologies, | ||
47 | when they are included, will be typed in a | ||
48 | roman-letter transcription (See section on Greek transcription, below). | ||
49 | Only a very few Greek words have been thus transcribed as of the | ||
50 | present version (version 0.41). | ||
51 | Wherever the typists did not know the character to use, they | ||
52 | usually inserted a reverse-video question mark (decimal 176). | ||
53 | This appears in full-ASCII versions as <?/. This mark was used both for | ||
54 | characters in non-ASCII fonts, and for unreadable characters (i.e., | ||
55 | characters smeared in the original or distorted in the copies available | ||
56 | to the typists. The type in the original was in many places smeared and | ||
57 | illegible at the left and right page margins; occasionally, small | ||
58 | parts of words were blotted out by plain white space). | ||
59 | A character table for the high-order characters appears below. | ||
60 | Under that is a list and description of most of the special characters | ||
61 | used in the Webster files. | ||
62 | Note that there are yet some characters used in the etymologies, | ||
63 | and some other symbols, which are not in this list. For example, the | ||
64 | vowels with a double dot *underneath*, e.g. a (as in all) have no representation | ||
65 | in this character set, and, where explicitly entered in the dictionary, | ||
66 | are represented by <xdd/ where "x" is the letter, as in "<add/". | ||
67 | |||
68 | ITALICS | ||
69 | ------- | ||
70 | In most places, italic font is represented by the tags <it>...</it> | ||
71 | surrounding the italic text, or by some other tag which also implies | ||
72 | italic font. In the pronunciations, however, where italicized vowels | ||
73 | are used among non-italic and other special characters to indicate | ||
74 | pronunciation, the special codes <ait/, <eit/, <iit/, <oit/, <uit/, | ||
75 | are also used to indicate the italicized vowel. | ||
76 | |||
77 | DIACRITICS | ||
78 | ------------- | ||
79 | The European grave and acute accents are represented by the | ||
80 | standard (IBM PC) high-order codes. Other characters with diacritics | ||
81 | are represented by special "entity" codes, and in some cases also | ||
82 | are found in this special WEB1913 font, described below. | ||
83 | Vowels with a circle above (as in Swedish) are coded <xring/ | ||
84 | (x with a ring, or "degrees" mark over it); vowels with tilde over them | ||
85 | are represented by <xtil/, where "x" is the vowel, as in <etil/ (<atil/ | ||
86 | also has code 238); letters with a dot above are represented by <xdot/ | ||
87 | -- letter with a dot below are represented by <xsdot/ ("subdot"); | ||
88 | vowels with the semi-long mark (a macron with a short perpendicular | ||
89 | vertical stroke attached above) are represented by <xsl/; the | ||
90 | circumflex vowels have codes on this list, but may also be represented | ||
91 | as <xcir/; vowels with macrons above are <xmac/ (including <oomac/, | ||
92 | the "oo" with an unbroken macron above the two letters, <aemac/ = the | ||
93 | ligature ae with a macron [also 214 = \'d6], and <oemac/ the ligature | ||
94 | oe with a macron [also 215 = \'d7]); vowels with umlauts or a crescent | ||
95 | (breve) above have codes in this list, but may also be represented by | ||
96 | <xum/ and <xcr/ respectively. There is an occasional hacek or caron mark | ||
97 | (an inverted circumflex) in the original; such letters are coded <xcar/. | ||
98 | The o with a caron has code 213, but no others are in this font list. | ||
99 | The diaeresis is treated typographically as identical to the umlaut. | ||
100 | A special modification, used only for poetry (see entry "saturnian verse" | ||
101 | under "saturnian") is a vowel with a macron, in which the macron is lighter | ||
102 | than the usual macron, signifying a stressed syllable which has a short | ||
103 | vowel sound. This is represented by <xsmac/ ("short mac"). | ||
104 | Another special character used in pronunciations is an "n" with an underline (like | ||
105 | a macron, but below the letter), used to represent the "ng" sound. This is coded | ||
106 | <nsm/ ("n sub-macron"). The ligated th used in pronunciations to depict the | ||
107 | "th" sound of "the" is coded as <th/. | ||
108 | NOTE: the letter combinations "fi" and "fl" are invariably printed as the | ||
109 | ligatures fi and fl, but these ligatures are not marked as such | ||
110 | in this transcription, and the two letters are left as individuals. | ||
111 | |||
112 | SPECIAL SYMBOLS | ||
113 | The dagger <dag/, double dagger <ddag/, and paragraph mark <para/ are rarely used. | ||
114 | The double prime, or "seconds" of a degree is sometimes represented by | ||
115 | a double "light accent" (code 183 = \'b7). In other places, and in later | ||
116 | versions, it is represented by <sec/ = hex a9, in the webfont. | ||
117 | The symbols "greater than" <gt/ and "less than" are encountered only | ||
118 | once, but are distinguished from the right- and left-angle brackets | ||
119 | (> and <) because of possible typographical differences in some fonts. | ||
120 | The schwa is symbolized by <schwa/. It is not used in the | ||
121 | pronunciations, but is mentioned as a symbol. | ||
122 | The right-pointing arrow is <rarr/, consistent with ISO 8879. | ||
123 | |||
124 | ---------------------------------- | ||
125 | Table 1 | ||
126 | ---------------------------------- | ||
127 | Numbers | ||
128 | Hex codes | ||
129 | 1 | ||
130 | 11 (12 is a hard page break, 13 CR, 14 sect break) | ||
131 | 21 | ||
132 | 31 !"# $%&'( | ||
133 | 121 yz{|} ~ 79-7d 7e-82 | ||
134 | 131 83-87 88-8c | ||
135 | 141 8d-91 92-96 | ||
136 | 151 97-9b 9c-a0 | ||
137 | 161 a1-a5 a6-aa | ||
138 | 171 ab-af b0-b4 | ||
139 | 181 b5-b9 ba-be | ||
140 | 191 bf-c3 c4-c8 | ||
141 | 201 c9-cd ce-d2 | ||
142 | 211 d3-d7 d8-dc | ||
143 | 221 dd-e1 e2-e6 | ||
144 | 231 e7-eb ec-f0 | ||
145 | 241 f1-f5 f6-fa | ||
146 | 251 fb-ff | ||
147 | |||
148 | =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=- | ||
149 | Below is a complete list of the symbols used in the Webster ("webfont") | ||
150 | which are encoded in the special font listed above, together with | ||
151 | corresponding symbols in ISO 8879 and Tex coding. Much of this table was | ||
152 | prepared by Rik Faith, to whom we express our appreciation. | ||
153 | The "nearest ASCII" equivalents are given for those who want to | ||
154 | display the data as best one can in 7-bit simple ASCII symbols without | ||
155 | using the "entity" symbols. | ||
156 | ========================================================================= | ||
157 | ---------------------------------- | ||
158 | Table 2 | ||
159 | ---------------------------------- | ||
160 | |||
161 | Comments: | ||
162 | (1) The symbol in the "entity" column is the SGML-like symbol used in | ||
163 | the present Webster files; the symbol in the "ISO 8879" column is | ||
164 | the symbol for the same character given in "The user's guide to | ||
165 | ISO 8879" by Smith and Stutely. | ||
166 | (2) An asterisk "*" in the "entity" column means that this symbol and | ||
167 | code value is not used in any form in the Webster 1913 electronic version. | ||
168 | (3) If no asterisk is in the "entity" column, and no other symbol is | ||
169 | there, this means that in the Webster, only the hexadecimal representation | ||
170 | was used (e.g. for \'d8, \'bd, and \'b8). | ||
171 | (4) \'b6 and \'b7, the heavy and light "accents", are never above a | ||
172 | letter (these are not diacritical marks), but in-between letters, as the | ||
173 | stress accent used in the headwords and pronunciations. The accent | ||
174 | *follows* the syllable accented. The light accent \'b7 is also used as | ||
175 | the "prime" in mathematical expressions (e.g. a\'b7 = "a prime"), or as | ||
176 | "minutes" in degrees-minutes-seconds, and when doubled (\'b7\'b7) | ||
177 | serves as "double prime" in mathematical expressions, and as "seconds" | ||
178 | in degrees-minutes-seconds. The character \'a9 (<sec/ or ″) is | ||
179 | also used to represent the double prime. | ||
180 | (5) Although the semilong vowels are in the table (e.g. the "asl" | ||
181 | = "a semilong", most of the entries in the ASCII version dictionary | ||
182 | use the <xsl/ symbol coding. If you know of any printers' names for | ||
183 | these, do let me know. | ||
184 | (6) For some reason, the a breve and u breve have ISO codes (in the | ||
185 | Latin-2 table), but the other vowels don't, in the Smith & Stutely book. | ||
186 | Is this a mistake? | ||
187 | (7) The symbol <nsc/ is used for "N small capitals", used in | ||
188 | pronunciations to represent the soun fo the nasal N in French words. | ||
189 | (8) A weak accent (when not in pronunciations) is symbolized by <prime/, the "minutes" (of a degree) symbol. A strong accent is symbolized by <bprime/ ("bold prime", not an ISO entity). | ||
190 | (9) If you find any exceptions to these usage assertions, please | ||
191 | let me know. | ||
192 | ---------------------------------------------------------------------------------------- | ||
193 | webfont ISO 8879 latin1/ascii TeX nearest description | ||
194 | ------------------ ASCII | ||
195 | oct dec hex entity oct dec hex | ||
196 | -------------------------------------------------------------------------------- | ||
197 | 025 21 15 * \S * section symbol | ||
198 | |||
199 | 074 60 3c lt 074 60 3c $<$ < less than | ||
200 | 076 62 3e gt 076 62 3e $>$ > greater than | ||
201 | |||
202 | 200 128 80 <Cced/ Ccedil 307 199 c7 \c{C} C C cedilla | ||
203 | 201 129 81 <uum/ uuml 374 252 fc \"u ue u umlaut (diaeresis) | ||
204 | 202 130 82 <eacute/ eacute 351 233 e9 \'e e e acute | ||
205 | 203 131 83 <acir/ acirc 342 226 e2 \^a a a circumflex | ||
206 | 204 132 84 <aum/ auml 344 228 e4 \"a ae a umlaut (diaeresis) | ||
207 | 205 133 85 <agrave/ agrave 340 224 e0 \`a a a grave | ||
208 | 206 134 86 <aring/ aring 345 229 e5 \aa a a ring above | ||
209 | 207 135 87 <cced/ ccedil 347 231 e7 \c{c} c c cedilla | ||
210 | 210 136 88 <ecir/ ecirc 352 234 ea \^e e e circumflex | ||
211 | 211 137 89 <eum/ euml 353 235 eb \"e e e umlaut (diaeresis) | ||
212 | 212 138 8a <egrave/ egrave 350 232 e8 \`e e e grave | ||
213 | 213 139 8b <ium/ iuml 357 239 ef \"i i i umlaut (diaeresis) | ||
214 | 214 140 8c <icir/ icirc 356 238 ee \^i i i circumflex | ||
215 | 215 141 8d <igrave/ igrave 354 236 ec \`i i i grave | ||
216 | 216 142 8e <Aum/ Auml A A umlaut | ||
217 | 217 143 8f Aring A A ring above | ||
218 | |||
219 | 220 144 90 <Eacute/ Eacute 311 201 c9 \'E e E acute | ||
220 | 221 145 91 <ae/ aelig 346 230 e6 \ae ae ligature ae | ||
221 | 222 146 92 <AE/ AElig 306 198 c6 \AE AE ligature AE | ||
222 | 223 147 93 <ocir/ ocirc 364 244 f4 \^o o o circumflex | ||
223 | 224 148 94 <oum/ ouml 366 246 f6 \"o oe o umlaut (diaeresis) | ||
224 | 225 149 95 <ograve/ ograve 362 242 f2 \`o o o grave | ||
225 | 226 150 96 <ucir/ ucirc 373 251 fb \^u u u circumflex | ||