diff options
author | Sergey Poznyakoff <gray@gnu.org.ua> | 2012-01-30 01:33:31 +0200 |
---|---|---|
committer | Sergey Poznyakoff <gray@gnu.org.ua> | 2012-01-30 01:37:44 +0200 |
commit | 45bf9d9c1860e7950f073da828cfd5aed34d3491 (patch) | |
tree | 1f664e8d13bd41bf97b731925550890246026ee6 | |
parent | 96d405c0c71882883e63a2fb19baa8d4017a698f (diff) | |
download | gcide-45bf9d9c1860e7950f073da828cfd5aed34d3491.tar.gz gcide-45bf9d9c1860e7950f073da828cfd5aed34d3491.tar.bz2 |
Update ancillary files.
* README.DIC: Update versiona number and file names.
* WEBFONT.ASC: Document <oacute/, <ldquo/ and <rdquo/.
-rw-r--r-- | README.DIC | 34 | ||||
-rw-r--r-- | WEBFONT.ASC | 6 |
2 files changed, 20 insertions, 20 deletions
@@ -1,12 +1,12 @@ | |||
1 | File README.DIC | 1 | File README.DIC |
2 | To accompany the GNU version of the set of files (cide.*) containing | 2 | To accompany the GNU version of the set of files (cide.*) containing |
3 | the electronic version of the | 3 | the electronic version of the |
4 | Collaborative International Dictionary of English. | 4 | Collaborative International Dictionary of English. |
5 | (called also GCIDE) | 5 | (called also GCIDE) |
6 | These files contain Version 0.46 (January 2002) | 6 | These files contain Version 0.51 (January 2012) |
7 | * * * * * * * * * * * * * * * * * * * * * * * * * * * * | 7 | * * * * * * * * * * * * * * * * * * * * * * * * * * * * |
8 | 8 | ||
9 | The dictionary was derived from the | 9 | The dictionary was derived from the |
10 | Webster's Revised Unabridged Dictionary | 10 | Webster's Revised Unabridged Dictionary |
11 | Version published 1913 | 11 | Version published 1913 |
12 | by the C. & G. Merriam Co. | 12 | by the C. & G. Merriam Co. |
@@ -64,15 +64,15 @@ to the Free Software Foundation, Inc., 59 Temple Place - Suite 330, | |||
64 | Boston, MA 02111-1307, USA. | 64 | Boston, MA 02111-1307, USA. |
65 | * * * * * * * * * * * * * * * * * * * * * | 65 | * * * * * * * * * * * * * * * * * * * * * |
66 | 66 | ||
67 | STRUCTURE OF THE DICTIONARY | 67 | STRUCTURE OF THE DICTIONARY |
68 | --------------------------- | 68 | --------------------------- |
69 | When the archives are unpacked, the main dictionary text of | 69 | When the archives are unpacked, the main dictionary text of |
70 | the GCIDE will be found in 26 files named "cide.*", where the | 70 | the GCIDE will be found in 26 files named "CIDE.*", where the |
71 | asterisk indicates which letter of the alphabet begins the | 71 | asterisk indicates which letter of the alphabet begins the |
72 | words in each file. For example, file "cide.b" contains words | 72 | words in each file. For example, file "CIDE.B" contains words |
73 | beginning with the letter "B". Additional information about the | 73 | beginning with the letter "B". Additional information about the |
74 | tagging conventions and special character symbols are contained in | 74 | tagging conventions and special character symbols are contained in |
75 | ancillary files in this directory more information below). The main | 75 | ancillary files in this directory more information below). The main |
76 | body of the 1913 dictionary was essentially identical to the edition | 76 | body of the 1913 dictionary was essentially identical to the edition |
77 | published in 1890, and was republished in 1913 with an appendix | 77 | published in 1890, and was republished in 1913 with an appendix |
78 | containing "New Words". The new words of that appendix have been | 78 | containing "New Words". The new words of that appendix have been |
@@ -100,13 +100,13 @@ definition or other text in that paragraph came, as follows: | |||
100 | reformatted or slightly rearranged. If substantive information | 100 | reformatted or slightly rearranged. If substantive information |
101 | is added from a second source, usually the additional source is | 101 | is added from a second source, usually the additional source is |
102 | also noted, as in: | 102 | also noted, as in: |
103 | [<source>Webster 1913 Suppl.</source> + <source>WordNet 1.5</source>] | 103 | [<source>Webster 1913 Suppl.</source> + <source>WordNet 1.5</source>] |
104 | 104 | ||
105 | A list of the ancillary files related to the GCIDE is appended at | 105 | A list of the ancillary files related to the GCIDE is appended at |
106 | the bottom of this "readme.dic" file. | 106 | the bottom of this "README.DIC" file. |
107 | This version is tagged with SGML-like tags of the form <pos>...</pos> | 107 | This version is tagged with SGML-like tags of the form <pos>...</pos> |
108 | so that the original typography (italics, bold, block quotes) can be | 108 | so that the original typography (italics, bold, block quotes) can be |
109 | reproduced. A list of the most important tags for fields in the | 109 | reproduced. A list of the most important tags for fields in the |
110 | dictionary is given below. The tags also serve the more important | 110 | dictionary is given below. The tags also serve the more important |
111 | function of allowing the information content to be conveniently imported | 111 | function of allowing the information content to be conveniently imported |
112 | into computer programs or databases. The set of tags used is described | 112 | into computer programs or databases. The set of tags used is described |
@@ -116,17 +116,17 @@ as <note> and <cs> ("collocation section"), which in some cases span | |||
116 | multiple paragraphs. If you are using a tag parser which detects | 116 | multiple paragraphs. If you are using a tag parser which detects |
117 | improper nesting, you should first either delete the paragraph | 117 | improper nesting, you should first either delete the paragraph |
118 | tags or convert them to non-tag symbols, or, if possible, set the | 118 | tags or convert them to non-tag symbols, or, if possible, set the |
119 | parser to ignore the <p>...</p> tags. | 119 | parser to ignore the <p>...</p> tags. |
120 | The unusual characters (such as Greek or the European accented | 120 | The unusual characters (such as Greek or the European accented |
121 | characters, as well as special characters used in the pronunciations) | 121 | characters, as well as special characters used in the pronunciations) |
122 | are described in the accompanying file "webfont.asc". Some information | 122 | are described in the accompanying file "WEBFONT.ASC". Some information |
123 | on the pronunciation system used may be found by viewing the files | 123 | on the pronunciation system used may be found by viewing the files |
124 | "wxxvii.jpg" and "pronunc.jpg" with a GIF viewer (or any web browser), | 124 | "WXXVII.JPG" and "PRONUNC.JPG" with a GIF viewer (or any web browser), |
125 | and additional explanations of pronunciation are in the file | 125 | and additional explanations of pronunciation are in the file |
126 | "pronunc.web". | 126 | "PRONUNC.WEB". |
127 | Each paragraph of the original text is enclosed within tags of | 127 | Each paragraph of the original text is enclosed within tags of |
128 | the form <p> . . . </p>. Within these paragraphs are no line | 128 | the form <p> . . . </p>. Within these paragraphs are no line |
129 | breaks, and some of the paragraphs are over 12,000 characters long. | 129 | breaks, and some of the paragraphs are over 12,000 characters long. |
130 | These lines are too long to be handled by the vi editor, and probably | 130 | These lines are too long to be handled by the vi editor, and probably |
131 | by some other text editors. At some points, embedded line breaks within | 131 | by some other text editors. At some points, embedded line breaks within |
132 | a "paragraph" are marked by a <br/ "entity". The file can therefore | 132 | a "paragraph" are marked by a <br/ "entity". The file can therefore |
@@ -151,13 +151,13 @@ comment of the form <-- p. 345 -->. (The exact beginning was in some | |||
151 | cases in the middle of a paragraph, which we decided was not a | 151 | cases in the middle of a paragraph, which we decided was not a |
152 | good location for these page-number comments, so the page number | 152 | good location for these page-number comments, so the page number |
153 | was usually moved to the next paragraph break). Pages which have | 153 | was usually moved to the next paragraph break). Pages which have |
154 | been proofread by volunteers (e.g., with initials VOL) will have a | 154 | been proofread by volunteers (e.g., with initials VOL) will have a |
155 | note within that page comment: <-- p. 345 pr=VOL -->. Pages which have | 155 | note within that page comment: <-- p. 345 pr=VOL -->. Pages which have |
156 | not been proofread yet (most of them) will have varying numbers of | 156 | not been proofread yet (most of them) will have varying numbers of |
157 | typographical errors in them. We still (January 2002) need | 157 | typographical errors in them. We still (January 2012) need |
158 | proofreaders to get the errors out of these dictionary files. | 158 | proofreaders to get the errors out of these dictionary files. |
159 | 159 | ||
160 | *********************************************************************** | 160 | *********************************************************************** |
161 | ** WARNING!!! ** | 161 | ** WARNING!!! ** |
162 | *********************************************************************** | 162 | *********************************************************************** |
163 | 163 | ||
@@ -188,21 +188,21 @@ PC at: pc@worldsoul.org. | |||
188 | 188 | ||
189 | In addition to the main text of the dictionary, additional | 189 | In addition to the main text of the dictionary, additional |
190 | explanatory material about this version of the dictionary is available | 190 | explanatory material about this version of the dictionary is available |
191 | in the ancillary files: | 191 | in the ancillary files: |
192 | 192 | ||
193 | ===================================================================== | 193 | ===================================================================== |
194 | COPYING 18,321 11-03-99 1:13a COPYING | 194 | -rw-r--r-- 1 18021 2012-01-30 00:24 COPYING |
195 | README DIC 13,775 01-17-02 11:48p readme.dic | 195 | -rw-r--r-- 1 2569796 2000-06-18 15:11 PRONUNC.JPG |
196 | WEBFONT ASC 35,234 12-12-01 3:27p WEBFONT.ASC | 196 | -rw-r--r-- 1 13994 2012-01-30 00:24 PRONUNC.WEB |
197 | TAGSET WEB 55,843 08-16-01 1:16p TAGSET.WEB | 197 | -rw-r--r-- 1 13507 2012-01-30 00:27 README.DIC |
198 | PRONUNC WEB 14,312 06-18-00 3:02p PRONUNC.WEB | 198 | -rw-r--r-- 1 144716 2000-06-18 15:13 SYMBOLS.JPG |
199 | PRONUNC JPG 2,569,796 06-18-00 3:11p PRONUNC.JPG | 199 | -rw-r--r-- 1 54783 2012-01-30 00:24 TAGSET.WEB |
200 | SYMBOLS JPG 144,716 06-18-00 3:13p SYMBOLS.JPG | 200 | -rw-r--r-- 1 34631 2012-01-30 00:24 WEBFONT.ASC |
201 | WXXVII JPG 1,188,380 06-18-00 3:19p WXXVII.JPG | 201 | -rw-r--r-- 1 1188380 2000-06-18 15:19 WXXVII.JPG |
202 | ================================================================== | 202 | ===================================================================== |
203 | 203 | ||
204 | 204 | ||
205 | Most important tags used in the GCIDE: | 205 | Most important tags used in the GCIDE: |
206 | <hw> tags the headword | 206 | <hw> tags the headword |
207 | <pr> pronunciation | 207 | <pr> pronunciation |
208 | <pos> part of speech | 208 | <pos> part of speech |
diff --git a/WEBFONT.ASC b/WEBFONT.ASC index 198c0e0..c80dfd3 100644 --- a/WEBFONT.ASC +++ b/WEBFONT.ASC | |||
@@ -230,13 +230,13 @@ oct dec hex entity oct dec hex | |||
230 | 234 156 9c <pound/ pound 243 163 a3 \pounds * pound sign (British) | 230 | 234 156 9c <pound/ pound 243 163 a3 \pounds * pound sign (British) |
231 | 235 157 9d * | 231 | 235 157 9d * |
232 | 236 158 9e * | 232 | 236 158 9e * |
233 | 237 159 9f * | 233 | 237 159 9f * |
234 | 240 160 a0 <aacute/ aacute 341 225 e1 \'a a a acute | 234 | 240 160 a0 <aacute/ aacute 341 225 e1 \'a a a acute |
235 | 241 161 a1 <iacute/ iacute 355 237 ed \'i i i acute | 235 | 241 161 a1 <iacute/ iacute 355 237 ed \'i i i acute |
236 | 242 162 a2 oacute 363 243 f3 \'o o o acute | 236 | 242 162 a2 <oacute/ oacute 363 243 f3 \'o o o acute |
237 | 243 163 a3 uacute 372 250 fa \'u u u acute | 237 | 243 163 a3 uacute 372 250 fa \'u u u acute |
238 | 244 164 a4 <ntil/ ntilde 361 241 f1 \~n ny n tilde | 238 | 244 164 a4 <ntil/ ntilde 361 241 f1 \~n ny n tilde |
239 | 245 165 a5 <Ntil/ Ntilde NY N tilde | 239 | 245 165 a5 <Ntil/ Ntilde NY N tilde |
240 | 246 166 a6 <frac23/ $\frac{2}{3}$ 2/3 two-thirds | 240 | 246 166 a6 <frac23/ $\frac{2}{3}$ 2/3 two-thirds |
241 | 247 167 a7 <frac13/ $\frac{1}{3}$ 1/3 one-third | 241 | 247 167 a7 <frac13/ $\frac{1}{3}$ 1/3 one-third |
242 | 250 168 a8 * | 242 | 250 168 a8 * |
@@ -258,18 +258,18 @@ oct dec hex entity oct dec hex | |||
258 | (printer's "fist") | 258 | (printer's "fist") |
259 | 266 182 b6 \"{} '' bold accent | 259 | 266 182 b6 \"{} '' bold accent |
260 | (used in pronunciations) | 260 | (used in pronunciations) |
261 | 267 183 b7 prime 264 180 b4 \'{} ' light accent | 261 | 267 183 b7 prime 264 180 b4 \'{} ' light accent |
262 | (used in pronunciations) | 262 | (used in pronunciations) |
263 | also minutes (of arc or time) | 263 | also minutes (of arc or time) |
264 | 270 184 b8 '' " close double quote | 264 | 270 184 b8 <rdquo/ '' " close double quote |
265 | 271 185 b9 * | 265 | 271 185 b9 * |
266 | 272 186 ba * $\parallel$ || verticle double bar (l) | 266 | 272 186 ba * $\parallel$ || verticle double bar (l) |
267 | 273 187 bb * | 267 | 273 187 bb * |
268 | 274 188 bc <sect/ sect \S * section mark | 268 | 274 188 bc <sect/ sect \S * section mark |
269 | 275 189 bd `` " open double quotes | 269 | 275 189 bd <ldquo/ `` " open double quotes |
270 | 276 190 be <amac/ amacr \=a a a macron | 270 | 276 190 be <amac/ amacr \=a a a macron |
271 | 277 191 bf lsquo ` ` left single quote | 271 | 277 191 bf lsquo ` ` left single quote |
272 | 272 | ||
273 | 300 192 c0 <nsm/ ng "n sub-macron" | 273 | 300 192 c0 <nsm/ ng "n sub-macron" |
274 | 301 193 c1 <sharp/ sharp $\sharp$ # musical sharp | 274 | 301 193 c1 <sharp/ sharp $\sharp$ # musical sharp |
275 | 302 194 c2 <flat/ flat $\flat$ * musical flat | 275 | 302 194 c2 <flat/ flat $\flat$ * musical flat |