aboutsummaryrefslogtreecommitdiff
path: root/webfont.txt
diff options
context:
space:
mode:
authorSergey Poznyakoff <gray@gnu.org.ua>2012-02-02 14:42:06 +0200
committerSergey Poznyakoff <gray@gnu.org.ua>2012-02-02 14:42:06 +0200
commit3d4fbac289846464491104b01bebe554da6758da (patch)
treeef314e6d3f0c12d1879e43c4c0bb5753cc9e5f78 /webfont.txt
parentb61268b9deea32b7d965808f47d1227e3197a83c (diff)
downloadgcide-3d4fbac289846464491104b01bebe554da6758da.tar.gz
gcide-3d4fbac289846464491104b01bebe554da6758da.tar.bz2
Reorganize the directory structure.
* .gitignore: New file. * Makefile: Fix the list of distributed files. * README.DIC: Rename to README and edit. * WXXVII.JPG: Remove. * abbrevn.lst: New file. * authors.lst: New file. * gcide.conf: New file. * PRONUNC.JPG: Rename to pronunc.jpg. * PRONUNC.WEB: Rename to pronunc.txt. * SYMBOLS.JPG: Rename to symbols.jpg * TAGSET.WEB: Rename to tagset.txt * WEBFONT.ASC: Rename to webfont.txt. * titlepage.png: New file.
Diffstat (limited to 'webfont.txt')
-rw-r--r--webfont.txt604
1 files changed, 604 insertions, 0 deletions
diff --git a/webfont.txt b/webfont.txt
new file mode 100644
index 0000000..591e980
--- /dev/null
+++ b/webfont.txt
@@ -0,0 +1,604 @@
1 WEBSTER FONTS
2 =============
3
4 Fonts for the Webster 1913 Dictionary.
5 For version 0.50
6 Last edit May 5, 2001
7 ______________________________________
8 (This file contains some extended ASCII characters, and should be
9transmitted in binary mode)
10----------------------------------------------------------------------
11
12 This file describes a modified font for use in visualizing the
13text of the 1913 "Webster's Revised Unabridged Dictionary" (W1913),
14usable for the DOS operating system of IBM-compatible personal computers.
15The electronic version of that dictionary and this font were prepared by
16MICRA, Inc., Plainfield NJ, and are copyrighted (C) 1996 by MICRA, Inc.
17For details of permissions and restrictions on using these files, see
18the accompanying file "readme.web".
19 The special characters used in the electronic version of the Webster
201913 are required for visualizing unusual characters used in the
21etymology and pronunciation fields of the dictionary, in a form
22comparable to the way they appear in the original. Since there are
23more than 256 characters used in that dictionary, not all can be
24represented by single-byte codes, and are instead represented by
25SGML-style "short-form" symbols. (rather than the "entity" format
26"&xx;" The ampersand is used frequently, and we prefer to leave
27the "<" as the only "escape" character) of the type <x/ where x
28is a specific code for the symbol in the dictionary.
29See the "Short Form" section below for details about such characters.
30Note that the symbols used here are in some cases abbreviations
31(for compactness) of the ISO 8879 recommended symbols. If necessary,
32the table below allows simple replacement by alternate encodings.
33 This symbol font can be loaded in IBM-compatible (x86) computers
34running the DOS operating system by using the "font.bat" command file
35in the "utils" directory. The fonts files for 8x14 and 8x16 fonts are
36"web14.fnt" and "web16.fnt" respectively.
37 For those loading the Webster onto some machine other than an
38IBM-compatible running DOS, it will be necessary to provide a
39translation table, to convert these characters into a code that
40can be handled by that computer. For this reason, I attach an
41"explanation" for each character, for those who cannot view
42the original DOS font.
43 The DOS-loadable font does not contain all of the characters needed
44to depict the etymologies or the pronunciations. In addition to an
45absence of several characters used in the pronunciations, no Greek letters are
46included. The Greek words appearing in the etymologies,
47when they are included, will be typed in a
48roman-letter transcription (See section on Greek transcription, below).
49Only a very few Greek words have been thus transcribed as of the
50present version (version 0.41).
51 Wherever the typists did not know the character to use, they
52usually inserted a reverse-video question mark (decimal 176).
53This appears in full-ASCII versions as <?/. This mark was used both for
54characters in non-ASCII fonts, and for unreadable characters (i.e.,
55characters smeared in the original or distorted in the copies available
56to the typists. The type in the original was in many places smeared and
57illegible at the left and right page margins; occasionally, small
58parts of words were blotted out by plain white space).
59 A character table for the high-order characters appears below.
60Under that is a list and description of most of the special characters
61used in the Webster files.
62 Note that there are yet some characters used in the etymologies,
63and some other symbols, which are not in this list. For example, the
64vowels with a double dot *underneath*, e.g. a (as in all) have no representation
65in this character set, and, where explicitly entered in the dictionary,
66are represented by <xdd/ where "x" is the letter, as in "<add/".
67
68ITALICS
69-------
70 In most places, italic font is represented by the tags <it>...</it>
71surrounding the italic text, or by some other tag which also implies
72italic font. In the pronunciations, however, where italicized vowels
73are used among non-italic and other special characters to indicate
74pronunciation, the special codes <ait/, <eit/, <iit/, <oit/, <uit/,
75are also used to indicate the italicized vowel.
76
77DIACRITICS
78-------------
79 The European grave and acute accents are represented by the
80standard (IBM PC) high-order codes. Other characters with diacritics
81are represented by special "entity" codes, and in some cases also
82are found in this special WEB1913 font, described below.
83 Vowels with a circle above (as in Swedish) are coded <xring/
84(x with a ring, or "degrees" mark over it); vowels with tilde over them
85are represented by <xtil/, where "x" is the vowel, as in <etil/ (<atil/
86also has code 238); letters with a dot above are represented by <xdot/
87-- letter with a dot below are represented by <xsdot/ ("subdot");
88vowels with the semi-long mark (a macron with a short perpendicular
89vertical stroke attached above) are represented by <xsl/; the
90circumflex vowels have codes on this list, but may also be represented
91as <xcir/; vowels with macrons above are <xmac/ (including <oomac/,
92the "oo" with an unbroken macron above the two letters, <aemac/ = the
93ligature ae with a macron [also 214 = \'d6], and <oemac/ the ligature
94oe with a macron [also 215 = \'d7]); vowels with umlauts or a crescent
95(breve) above have codes in this list, but may also be represented by
96<xum/ and <xcr/ respectively. There is an occasional hacek or caron mark
97(an inverted circumflex) in the original; such letters are coded <xcar/.
98The o with a caron has code 213, but no others are in this font list.
99The diaeresis is treated typographically as identical to the umlaut.
100 A special modification, used only for poetry (see entry "saturnian verse"
101under "saturnian") is a vowel with a macron, in which the macron is lighter
102than the usual macron, signifying a stressed syllable which has a short
103vowel sound. This is represented by <xsmac/ ("short mac").
104 Another special character used in pronunciations is an "n" with an underline (like
105a macron, but below the letter), used to represent the "ng" sound. This is coded
106<nsm/ ("n sub-macron"). The ligated th used in pronunciations to depict the
107"th" sound of "the" is coded as <th/.
108 NOTE: the letter combinations "fi" and "fl" are invariably printed as the
109ligatures &filig; and &fllig;, but these ligatures are not marked as such
110in this transcription, and the two letters are left as individuals.
111
112SPECIAL SYMBOLS
113 The dagger <dag/, double dagger <ddag/, and paragraph mark <para/ are rarely used.
114 The double prime, or "seconds" of a degree is sometimes represented by
115a double "light accent" (code 183 = \'b7). In other places, and in later
116versions, it is represented by <sec/ = hex a9, in the webfont.
117 The symbols "greater than" <gt/ and "less than" are encountered only
118once, but are distinguished from the right- and left-angle brackets
119(> and <) because of possible typographical differences in some fonts.
120 The schwa is symbolized by <schwa/. It is not used in the
121pronunciations, but is mentioned as a symbol.
122 The right-pointing arrow is <rarr/, consistent with ISO 8879.
123
124----------------------------------
125Table 1
126----------------------------------
127Numbers
128 Hex codes
1291  
13011   (12 is a hard page break, 13 CR, 14 sect break)
13121  
13231  !"# $%&'(
133121 yz{|} ~ 79-7d 7e-82
134131 83-87 88-8c
135141 8d-91 92-96
136151 97-9b 9c-a0
137161 a1-a5 a6-aa
138171 ab-af b0-b4
139181 b5-b9 ba-be
140191 bf-c3 c4-c8
141201 c9-cd ce-d2
142211 d3-d7 d8-dc
143221 dd-e1 e2-e6
144231 e7-eb ec-f0
145241 f1-f5 f6-fa
146251 fb-ff
147
148=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-
149Below is a complete list of the symbols used in the Webster ("webfont")
150which are encoded in the special font listed above, together with
151corresponding symbols in ISO 8879 and Tex coding. Much of this table was
152prepared by Rik Faith, to whom we express our appreciation.
153 The "nearest ASCII" equivalents are given for those who want to
154display the data as best one can in 7-bit simple ASCII symbols without
155using the "entity" symbols.
156=========================================================================
157----------------------------------
158Table 2
159----------------------------------
160
161Comments:
162 (1) The symbol in the "entity" column is the SGML-like symbol used in
163 the present Webster files; the symbol in the "ISO 8879" column is
164 the symbol for the same character given in "The user's guide to
165 ISO 8879" by Smith and Stutely.
166 (2) An asterisk "*" in the "entity" column means that this symbol and
167code value is not used in any form in the Webster 1913 electronic version.
168 (3) If no asterisk is in the "entity" column, and no other symbol is
169there, this means that in the Webster, only the hexadecimal representation
170was used (e.g. for \'d8, \'bd, and \'b8).
171 (4) \'b6 and \'b7, the heavy and light "accents", are never above a
172letter (these are not diacritical marks), but in-between letters, as the
173stress accent used in the headwords and pronunciations. The accent
174*follows* the syllable accented. The light accent \'b7 is also used as
175the "prime" in mathematical expressions (e.g. a\'b7 = "a prime"), or as
176 "minutes" in degrees-minutes-seconds, and when doubled (\'b7\'b7)
177serves as "double prime" in mathematical expressions, and as "seconds"
178in degrees-minutes-seconds. The character \'a9 (<sec/ or &Prime;) is
179also used to represent the double prime.
180 (5) Although the semilong vowels are in the table (e.g. the "asl"
181= "a semilong", most of the entries in the ASCII version dictionary
182use the <xsl/ symbol coding. If you know of any printers' names for
183these, do let me know.
184 (6) For some reason, the a breve and u breve have ISO codes (in the
185Latin-2 table), but the other vowels don't, in the Smith & Stutely book.
186Is this a mistake?
187 (7) The symbol <nsc/ is used for "N small capitals", used in
188pronunciations to represent the soun fo the nasal N in French words.
189 (8) A weak accent (when not in pronunciations) is symbolized by <prime/, the "minutes" (of a degree) symbol. A strong accent is symbolized by <bprime/ ("bold prime", not an ISO entity).
190 (9) If you find any exceptions to these usage assertions, please
191let me know.
192----------------------------------------------------------------------------------------
193 webfont ISO 8879 latin1/ascii TeX nearest description
194------------------ ASCII
195oct dec hex entity oct dec hex
196--------------------------------------------------------------------------------
197025 21 15 * \S * section symbol
198
199074 60 3c lt 074 60 3c $<$ < less than
200076 62 3e gt 076 62 3e $>$ > greater than
201
202200 128 80 <Cced/ Ccedil 307 199 c7 \c{C} C C cedilla
203201 129 81 <uum/ uuml 374 252 fc \"u ue u umlaut (diaeresis)
204202 130 82 <eacute/ eacute 351 233 e9 \'e e e acute
205203 131 83 <acir/ acirc 342 226 e2 \^a a a circumflex
206204 132 84 <aum/ auml 344 228 e4 \"a ae a umlaut (diaeresis)
207205 133 85 <agrave/ agrave 340 224 e0 \`a a a grave
208206 134 86 <aring/ aring 345 229 e5 \aa a a ring above
209207 135 87 <cced/ ccedil 347 231 e7 \c{c} c c cedilla
210210 136 88 <ecir/ ecirc 352 234 ea \^e e e circumflex
211211 137 89 <eum/ euml 353 235 eb \"e e e umlaut (diaeresis)
212212 138 8a <egrave/ egrave 350 232 e8 \`e e e grave
213213 139 8b <ium/ iuml 357 239 ef \"i i i umlaut (diaeresis)
214214 140 8c <icir/ icirc 356 238 ee \^i i i circumflex
215215 141 8d <igrave/ igrave 354 236 ec \`i i i grave
216216 142 8e <Aum/ Auml A A umlaut
217217 143 8f Aring A A ring above
218