Home | Up | Questions? |
A transcript of Webster's Seventh New Collegiate Dictionary [1] is available in a computer-readable form under license from Merriam-Webster, the publisher. This is not just a word list, but a copy of the entire dictionary including definitions, cross references, variants, synonyms and so on. It consists of some 15,696,929 characters, with 68,764 main entries.
The original dictionary was keyboarded onto the Q-32 computer at System Development Corporation (SDC) for a project headed by John Olney [2]. The dictionary was then heavily edited and moved onto an IBM 360. Magnetic tapes of this form were moved to IBM T. J. Watson Research Center and further processed by C. Alberga. A copy of this was acquired by Robert Amsler, who used the Pocket Dictionary for his dissertation [3]. We have acquired a copy of the dictionary from Amsler, and have modified it in many minor ways. This document describes our version.
The dictionary is such a large collection of text that it is broken up into 220 files for easier handling. These files reside under the names d.101, d.102, d.103, ..., d.320.
Figure 1 is a listing of d.index, the index file, showing the first and last word of each dictionary file. Note that the first letter of each word in a file is the same; that is when we switch from words starting with 'D' to words starting with 'E', we start a new file. Otherwise, files are broken to create roughly equal-sized files (from 70,000 to 80,000 characters).
d.101;a | d.156;dyeweed | d.211;making | d.266;scutum |
d.102;acclaim | d.157;e | d.212;manyfold | d.267;sego |
d.103;adduce | d.158;egotist | d.213;match | d.268;semester |
d.104;afflict | d.159;embrace | d.214;megalith | d.269;sericin |
d.105;album | d.160;enfilade | d.215;met | d.270;shall |
d.106;alto | d.161;epigram | d.216;milk | d.271;shirk |
d.107;anagram | d.162;espy | d.217;mistily | d.272;side |
d.108;ant | d.163;exact | d.218;monocot | d.273;single |
d.109;apothem | d.164;expiate | d.219;motion | d.274;sky |
d.110;arginine | d.165;f | d.220;mushroom | d.275;slumber |
d.111;aspen | d.166;farmer | d.221;n | d.276;snowsuit |
d.112;attic | d.167;female | d.222;negus | d.277;somatic |
d.113;awful | d.168;filing | d.223;nimiety | d.278;spahi |
d.114;b | d.169;flange | d.224;notarial | d.279;spile |
d.115;bank | d.170;fluid | d.225;o | d.280;spread |
d.116;bathos | d.171;force | d.226;offend | d.281;stalk |
d.117;belay | d.172;fourth | d.227;open | d.282;steep |
d.118;bicolor | d.173;front | d.228;orpine | d.283;stockade |
d.119;bitumen | d.174;g | d.229;overbear | d.284;strength |
d.120;blow | d.175;gasbag | d.230;p | d.285;stylus |
d.121;boor | d.176;geometer | d.231;pant | d.286;suction |
d.122;brake | d.177;glamour | d.232;part | d.287;supernal |
d.123;broker | d.178;golden | d.233;pavid | d.288;swastika |
d.124;bunny | d.179;graphic | d.234;penny | d.289;syncope |
d.125;c | d.180;grow | d.235;perorate | d.290;t |
d.126;cambial | d.181;h | d.236;phial | d.291;tangible |
d.127;captive | d.182;handsel | d.237;pick | d.292;tedder |
d.128;cast | d.183;head | d.238;piping | d.293;terminal |
d.129;cemetery | d.184;hematoma | d.239;platypus | d.294;theory |
d.130;chaperon | d.185;hiemal | d.240;poetry | d.295;thresher |
d.131;chili | d.186;holmium | d.241;ponder | d.296;timid |
d.132;chukker | d.187;host | d.242;potent | d.297;tongue |
d.133;claymore | d.188;hydro | d.243;prefer | d.298;townee |
d.134;cob | d.189;i | d.244;pricket | d.299;transom |
d.135;colonize | d.190;imbecile | d.245;progeny | d.300;triduum |
d.136;company | d.191;improve | d.246;prot{e'}g{e'} | d.301;truck |
d.137;conduit | d.192;indaba | d.247;puff | d.302;turn |
d.138;conspire | d.193;infect | d.248;putty | d.303;u |
d.139;cookout | d.194;insect | d.249;q | d.304;underrun |
d.140;cosmetic | d.195;interior | d.250;quit | d.305;unquote |
d.141;crack | d.196;invoke | d.251;r | d.306;urologic |
d.142;crone | d.197;item | d.252;rank | d.307;v |
d.143;cumshaw | d.198;j | d.253;rebus | d.308;vegetal |
d.144;cystic | d.199;jive | d.254;reenact | d.309;vestment |
d.145;d | d.200;k | d.255;reline | d.310;visible |
d.146;debit | d.201;kiosk | d.256;repulse | d.311;w |
d.147;degauss | d.202;l | d.257;retrieve | d.312;wash |
d.148;depend | d.203;lapse | d.258;rickets | d.313;wedge |
d.149;detour | d.204;lean | d.259;rockweed | d.314;whiffler |
d.150;did | d.205;leveret | d.260;roust | d.315;wildling |
d.151;dirt | d.206;limit | d.261;ruthenic | d.316;wittily |
d.152;disperse | d.207;living | d.262;s | d.317;wreathe |
d.153;do | d.208;lose | d.263;samisen | d.318;x |
d.154;doublet | d.209;lyceum | d.264;sawlog | d.319;y |
d.155;drollery | d.210;m | d.265;scincoid | d.320;z |
Figure 2 shows a sample of one of the dictionary files.
F;dull;;;;aj;; |
P;'d{e}l |
E;ME [italic dul]; akin to OE [italic dol] foolish and prob. to L [italic# |
fumus] smoke -- more at [mini FUME] |
D;1;;;aj;mentally slow : [mini STUPID] |
D;2;a;;aj;slow in perception or sensibility : [mini INSENSIBLE] |
D;2;b;;aj;lacking zest or vivacity : [mini LISTLESS] |
D;3;a;;aj;slow in action : [mini SLUGGISH] |
D;3;b;;aj;marked by little business activity <~ season> |
D;4;;;aj;lacking sharpness of edge or point |
D;5;;;aj;lacking brilliance or luster |
D;6;a;;aj;not clear : [mini INDISTINCT] |
D;6;b;;aj;not resonant or ringing |
L;7;;;[italic of a color] |
D;7;;;aj;low in saturation and low in lightness |
D;8;;;aj;[mini CLOUDY], [mini OVERCAST] |
D;9;;;aj;[mini TEDIOUS], [mini UNINTERESTING] |
R;dull;;vb;; |
R;dullness;3;n;; |
V;dulness;3;01 |
P;'d{e}l-n{e}s |
R;dully;3;av;; |
P;'d{e}l-(l){e-} |
S;0;[mini BLUNT], [mini OBTUSE]: |
S;1;[mini DULL] applies physically to an edge or point that has lost its# |
original sharpness through use; figuratively it suggests loss of original or# |
usual quickness, zest, or pungency; |
S;2;[mini BLUNT] applies to an edge or end not designed to be sharp or keen;# |
it suggests innate or inherent lack of sharpness or quickness of feeling or# |
perception; |
S;3;[mini OBTUSE] applies to the shape of something whose point is broader# |
than a right angle; it suggests that which is inordinately blunt or# |
insensitive in perception or imagination <[italic obtuse] audience> |
X;stupid;;;8; |
Each line of the file has a character in column 1 which identifies the type and format of the line. The following table shows the number and meaning of each line type.
Frequency | Line type | Meaning |
68,764 | F | First line, start for a new main entry |
30,673 | E | Etymology |
66,987 | P | Pronunciation |
9,959 | V | Variant |
140,501 | D | Definition, one per line |
19,123 | R | Related word |
4,596 | X | Cross-Reference |
11,992 | L | Label |
835 | S | Synonym block |
Each line is composed of a number of fields. Fields are separated by a semicolon and are defined by their position. The first field of each line is the line type character (F, E, P, V, D, L, R, X, or S, as given above). The remaining fields depend upon the type of the line. For example, the second entry on an F-line is a main entry word, the fifth field has hyphenation information, and the seventh has part of speech information.
Figure 3 shows the contents of the fields for each line type. This report discusses each line and field in more detail.
F lines | Main entry |
F1 | F |
F2 | Main entry |
F3 | Homograph Number |
F4 | Prefix/Suffix/Infix |
F5 | Hyphenation |
F6 | Part of Speech |
F7 | Part of Speech Joiner |
F8 | Secondary Part of Speech |
E lines | Etymology |
E1 | E |
E2 | text |
P lines | Pronunciation |
P1 | P |
P2 | text |
V lines | Variants |
V1 | V |
V2 | Variant Word |
V3 | Hyphenation |
V4 | Variant Level |
D lines | Definitions |
D1 | D |
D2 | Sense number |
D3 | Sense letter |
D4 | Sense subnumber |
D5 | Part of Speech |
D6 | Text definition |
R cards | Related Words |
R1 | R |
R2 | Related Word or Phrase |
R3 | Hyphenation |
R4 | Part of Speech |
R5 | Part of Speech Joiner |
R6 | Secondary Part of Speech |
X lines | Cross Reference |
X1 | X |
X2 | Word |
X3 | superscript, if any |
X4 | subscript, if any |
X5 | Type of cross reference |
X6 | Secondary word |
L lines | Labels |
L1 | L |
L2 | Sense Number |
L3 | Sense Letter |
L4 | Sense Subnumber |
L5 | Label Text |
S lines | Synonym Block |
S1 | S |
S2 | Synonym number |
S3 | text |
Some lines, particularly definitions and synonym blocks are quite long and hence it is difficult to fit them on one 80 character line. Therefore, lines are split whenever necessary so that no physical line exceeds 80 characters. Lines are always split at a blank, and the incomplete line is terminated with a sharp or hash mark character ('#'). When processing the dictionary, if a line is terminated with a # character, then the # character should be replaced by a blank and the next physical line should be read in and appended to the previous line.
A major problem with the dictionary is its character set. First, the dictionary publisher did not feel constrained in his use of characters, but choose whatever symbols best fit his purpose. Second, the dictionary was originally encoded in an extended BCD (for the Q-32 computer), then translated into EBCDIC (for the IBM 360/370) and now has been translated into ASCII. None of these character sets is completely compatible with the others, nor is any of them sufficient to represent all of the variation found in the original printed dictionary. Hence an encoding scheme must be used to expand the set of representable characters.
This expansion occurs in two independent directions: font information and special characters.
Font information is represented by use of the square brackets in ASCII to surround any special font material. Five font types are recognized: (1) italic, (2) MINI-CAPS, (3) bold, (4) subscripts, and (5) superscripts. Each is denoted by an identifying keyword immediately after the opening left square bracket, followed by a space, followed by the material to be in the defined font, followed by the closing right square bracket. For example, an italic was is represented as "[italic was]", while a mini-caps AMBIENT is "[mini AMBIENT]" and a bold syn is "[bold syn]". Superscripts and subscripts may be italic, mini-caps, or bold, and a few superscripted superscripts also occur, as in "6.24 {times} 10 [sup 10 [sup 10]]".
Frequency | Keyword | Meaning |
100,927 | italic | italic |
61,883 | mini | mini caps |
2,171 | sub | subscript |
4,520 | sup | superscript |
The dictionary includes a large number of special symbols which are not representable in ASCII. These include all the Greek alphabet, the Hebrew alphabet, and many miscellaneous other special symbols. All special symbols which are not available in ASCII (and some that are) have been given names which are represented by the name encased in braces, as {degrees} (for a degree symbol) {times} (for multiplication represented by a small x), {tau} (for the lower-case greek letter tau), and so on. A complete list of these special symbols is in Appendix II.
Each symbol name has been selected to exclude embedded blanks. Thus all characters between an opening right brace and its closing right brace are non-blank. Certain characters which are in ASCII (braces, brackets, question mark, exclamation mark, and so on) have also been represented in this way because it allowed them to be used for other purposes (such as font and special character representation), and they occurred only infrequently (less than 100 times).
While processing W7 both to understand its contents and to put those contents into a usable form, we encountered a large number of errors. These errors were of several types:
Another form of error analysis was an attempt to find typographical errors. The approach was simple: we extracted a list of all unique words used in the dictionary definitions. This produced a list of 54,298 words. We compared this list with the list of all words defined in the dictionary (main entries, variants, and related words). This reduced our list to 20,292 words that were used in definitions, but not defined. Many of these were derived forms of defined words: past tense, plurals, and so on. Doing some simple suffix analysis, we were left with about 8,000 words. Most of these were apparently Greek or Latin botanical or zoological names. Deleting those ending in -ia or -ae and all words in italics in the dictionary left a list of 2,821 words.
These were checked by hand to produce a list of 903 incorrectly spelled words (typographical errors in keying the dictionary into the computer). We also found 54 words which were used, but not defined:
Australasian | bloomery | broadheaded | clothesline |
crossbeam | darkskinned | dinnerware | doorbell |
entranceway | equivalve | etc | fieldworks |
flattish | foreseen | foretold | fourpence |
gunstock | hairdress | hindlimbs | homeward |
hyperactivity | leftward | lightcolored | longlegged |
lowcut | messroom | mr. | nailhead |
neckband | noctuid | nubby | parimutuel |
partaken | pregenital | pyrotechny | rangeland |
rosebush | sawteeth | seneschal | sheepdogs |
shorthaired | sightsee | snowstorm | songbook |
spinymargined | spondumene | sulphates | supersensitized |
tv | twelvefold | understock | upcurved |
valency | workpiece. |
We also found a smaller list of words with typographical errors in the main entry in the computer files.
Of the 903 typographical errors, 543 were the result of a missing blank between two words. Of the remaining 360, 34% were a missing letter, 27% were a wrong letter, 20% were an extra letter, and 13% were the result of transposed letters. The remaining errors were caused by two extra or two missing letters, or by transposing two letters around a third. The middle letter in this case was always a vowel. (For example, min would be typed nim.)
We also found cases of typographical errors in the original printed dictionary. These are
[barranca] | gulley => gully |
[bitch] | doublecross => double-cross |
[capsicum] | genu => genus |
[drift] | quantitive => quantitative |
[fornication] | NCE => New Catholic Edition |
[kid] | goodhumored => good-humored |
[lapse] | apostasize => apostatize |
[lycopodium] | clubmosses => club mosses |
[type species] | permanenty => permanently |
[vanity] | knicknack => knickknack |
[select] | mismatched parenthesis |
[terpineol] | mismatched parenthesis |
[tapa] | an Hawaiian (but see 'a Hawaiian' for [luau] and [poi]) |
[pay] | REQUITE in syn not defined |
[rude] | ROUGH in syn not defined |
[Lag b'Omer] | 33d => 33rd |
[one] | 3d-person => 3rd-person |
It was interesting to follow these errors through the various printings and editions of the Merriam-Webster dictionaries. Four errors were corrected in the 1970 printing of W7, one in the 1971 printing of W7, and one in the 1975 printing of the New Collegiate Dictionary. At least three errors remain in the 1983 printing of the Ninth New Collegiate ([bitch], [drift], [vanity]).
Additional error checking is continuing. Jonathan Slocum suggested checking that main entries are in alphabetical order. This revealed 41 more errors including the following errors in the printed version:
cerate | cerat- |
chalk up | chalkstone |
ster | stepwise |
unchangeable | unchange |
Many words include a cross-reference to a group of synonyms. For example, under compact, is the reference "[bold syn] see [mini CLOSE]", and close is followed by a synonym block. The following words make reference to non-existent synonym blocks. That is, under corral we have the entry "[bold syn] see [mini ENCLOSE]", but there is no synonym block for enclose.
corral | [bold syn] see enclose |
parable | [bold syn] see allegory |
put | [bold syn] see set |
put up | [bold syn] see reside |
rebellious | [bold syn] see insubordinate |
reputation | [bold syn] see fame |
sidestep | [bold syn] see dodge |
squirm | [bold syn] see writhe |
superfluity | [bold syn] see excess |
surplusage | [bold syn] see excess |
tickle | [bold syn] see please |
time | [bold syn] see opportunity |
transgression | [bold syn] see breach |
urgent | [bold syn] see pressing |
vault | [bold syn] see jump |
This field occurs 68,764 times and is always F. It signals the start of a new set of cards related to one dictionary entry.
This field contains a main entry, a word or phrase, which is defined in the dictionary. There are 60099 unique values for this field. Generally, this is a lower-case word, but entries include,
This field is normally empty, but may contain a number (from 1 to 9) representing a homograph number. The following list indicates the number of entries with each homograph number. An entry with a maximum homograph of 5 will have entries with homograph numbers of 1, 2, 3, 4, and 5.
Frequency | Homograph number |
6538 | 1 |
6542 | 2 |
1427 | 3 |
475 | 4 |
164 | 5 |
54 | 6 |
17 | 7 |
5 | 8 |
3 | 9 |
The two entries with exactly 8 homographs are bob and like; the three entries with 9 homographs are list, rack, and tip.
For prefix and suffix entries, this field has either a p (for prefix), s (for suffix) or i (for infix). Normally, it is empty.
Frequency | Code | Meaning |
663 | p | prefix |
444 | s | suffix |
2 | i | infix |
The two entries with an i value are -i- and -o-.
This field contains hyphenation information. This is a sequence of one-digit numbers giving the number of characters between possible hyphenation points. As an example, a word such as estimate is hyphenated as es-ti-mate, and would be encoded as 22. The distance from the start of the word to the first hyphenation point is two, and so is the next syllable. The word ethnological, hyphenated as eth-no-log-i-cal, is encoded as 3231.
The encoding of the distance between hyphenation points is one digit, from 1 to 9. If the distance exceeds 9, we continue with the upper case letters (as with hexadecimal). Thus A is 10, B is 11, C is 12, ..., Z is 35.
There are 773 distinct values for this field. The most common entries are:
Frequency | Hyphenation encoding |
6777 | 3 |
4990 | 4 |
3924 | 2 |
2471 | 32 |
2148 | 23 |
1935 | 5 |
1875 | 33 |
1813 | 22 |
985 | 31 |
804 | 42 |
The frequency of occurrence for each of the individual characters is:
Frequency | Hyphenation Distance Code |
467 | 1 |
1142 | 2 |
883 | 3 |
350 | 4 |
129 | 5 |
57 | 6 |
52 | 7 |
38 | 8 |
19 | 9 |
8 | A |
10 | B |
6 | C |
4 | D |
3 | E |
2 | F |
1 | G |
1 | I |
The longest hyphenation encoding is
pneumonoultramicroscopicsilicovolcanoconiosis;4222323423123222213 pneu-mo-no-ul-tra-mi-cro-scop-ic-sil-i-co-vol-ca-no-co-ni-o-sis
This field identifies the primary part of speech of the main entry. The part of speech is encoded in a two-character mnemonic abbreviation. The parts of speech codes and their frequency are listed below.
Frequency | Code | Part of Speech | Example |
42590 | n | noun | |
13250 | aj | adjective | |
4972 | vt | verb, transitive | |
2423 | vb | verb | |
1432 | av | adverb | |
1363 | vi | verb, intransitive | |
516 | cf | combining form | |
164 | pp | preposition | |
150 | nc | noun combining form | |
121 | tm | trademark | |
108 | pn | pronoun | |
107 | ns | noun suffix | |
96 | cj | conjunction | |
94 | ij | interjection | |
74 | pf | prefix | |
51 | js | adjective suffix | |
12 | vs | verb suffix | |
10 | vp | verb imperative | |
6 | as | adjective suffix | |
4 | va | verbal auxiliary | |
3 | ia | indefinite article | |
2 | vm | verb impersonal, past | meseems, methinks |
2 | vc | verb combining form | -lyze, -sect |
2 | sf | suffix | -est, -fold |
2 | da | definite article | the, ye |
1 | np | noun plural suffix | -s |
1 | is | interjection suffix | -o |
For those cases where two parts of speech are given for a single main entry (most separate parts of speech rate separate main entry listings), this field indicates how the part of speech given in field F8 is related to the part of speech of field F6.
Frequency | Code | Relationship |
48 | 1 | Equal |
205 | 2 | Secondary |
When a secondary part of speech is given, it is listed in field F8. The relationship between this part of speech and the part of speech listed in field F6 is either equal or secondary, and is indicated by the contents of field F7.
Frequency | Code | Secondary Part of Speech | Example |
182 | aj | adjective | |
34 | av | adverb | |
23 | n | noun | |
3 | vb | verb | |
2 | vt | verb, transitive | |
2 | js | adjective suffix | |
2 | cj | conjunction | |
2 | as | adjective suffix | |
1 | vi | verb, intransitive | bike |
1 | pp | preposition | betwixt |
1 | ps | pronoun suffix | -'s |
If you look at the number and combination of F6 and F8 fields, we find the following chart of pairs with more than one occurrence:
Frequency | F6 | F8 | Example |
171 | av | aj | OK |
34 | aj | av | ago |
19 | aj | n | ain |
7 | n | aj | Bolshevist |
3 | pn | aj | tother |
3 | n | vb | dupe |
2 | vs | js | -ed |
2 | n | vt | escarp |
2 | js | as | -er |
2 | av | cj | afore |
1 | vt | n | bield |
1 | vi | n | parle |
1 | vb | n | craunch |
1 | ns | pn | -'s |
1 | nc | aj | -chrome |
1 | n | vi | bike |
1 | av | pp | betwixt |
The E cards specify the linguistic derivation of words.
Frequency | Code |
30673 | E |
There are 26943 distinct values for this field. There appear to be internal semicolons (at least 4906 of them) which do not signify field boundaries. No analysis has yet been done of this field. I suspect that it can and should be reorganized to make this information easier to use.
Frequency | Most Frequent Values | Meaning |
987 | ISV | International Scientific Vocabulary |
535 | origin unknown | |
376 | NL | New Latin |
208 | F | French |
158 | L | Latin |
120 | imit. | imitative |
117 | ME | Middle English |
94 | ME, fr. OE | Middle English, from Old English |
79 | MF or L | Middle French or Latin |
49 | It | Italian |
The P cards specify the pronunciation of the preceding F, R, or V card.
Frequency | Code | Meaning |
66987 | P | Pronunciation |
This field has 56720 distinct values, and have basically been transcribed without interpretation or formatting, other than for special characters. Again, as with field E2, there appear to be internal semicolons (496 at least) and this field should be considered for possible reorganization.
There are 9959 V cards, specifying variant spellings for the preceding main entries.
Frequency | Code | Meaning |
9959 | V | Variant spelling |
This field contains the variant word or phrase. There are 9436 unique values for this entry. Again, as with field F2, there are entries with commas, apostrophes, hyphens, special characters, and subscripts.
The hyphenation encoding is as described for field F5. The syllable lengths run from 1 to 9, and also include a few entries of A, B, C, D, and F. The F value is for jack-in-the-boxes. The longest entries (onomatopoetically;2142213 and thermoperiodicity;4222131) are 7 digits in length. There are 262 unique values for this field.
This field encodes the relationship of the variant to its main entry. The encoding is a two-digit code. The first digit indicates the entry of which this is a variant. A "0" indicates that this is a variant of a main entry; a "1" indicates that this is a variant of the first inflected form (such as the comparative form of an adjective), and so on. The second digit indicates (with a "1") if the two forms are equally acceptable alternates, or (with a "2") if this variant form is a secondary variant.
Frequency | Variant Level | Meaning | Example |
5093 | 11 | irregular plural | ganglia |
2463 | 01 | variant spelling | intortion |
1228 | 21 | present participle | chagrining |
660 | 02 | secondary spelling | bivalved |
323 | 12 | birle under birl | |
99 | 31 | were under be | |
69 | 22 | bayoneting under bayonet | |
11 | 32 | awaking, cleaving, thriving | |
6 | 41 | flied, goes, up | |
1 | 51 | be | |
1 | 61 | are | |
1 | 71 | is | |
1 | 81 | are | |
1 | 91 | am | |
1 | A1 | being | |
1 | B1 | been |
It is not clear that this field is consistently or correctly defined.
This field introduces the definitions for the preceding main entry cards.
Frequency | Code | Meaning |
140501 | D | Definition |
Many words have many different meanings, or senses. Each different sense is given a sense number: the first sense is sense 1, and so on. Where an entry has only one sense, it is either sense zero or sense one.
Frequency | Sense Number | Example |
46190 | 0 | |
36417 | 1 | |
33870 | 2 | |
11956 | 3 | |
5204 | 4 | |
2682 | 5 | |
1559 | 6 | |
872 | 7 | |
532 | 8 | |
335 | 9 | |
233 | 10 | |
183 | 11 | |
143 | 12 | |
97 | 13 | |
66 | 14 | |
47 | 15 | |
29 | 16 | |
22 | 17 | |
19 | 18 | |
13 | 19 | |
9 | 21 | |
5 | 20 | |
5 | 22 | |
5 | 23 | |
4 | 24 | |
1 | 25 | set |
1 | 26 | set |
In addition to having multiple senses, some words have significant variations in meaning within a given sense. To distinguish these subsenses, they are given a sense letter in addition to a sense number. Thus, we can refer to sense 3a as well as sense 3b and 3c, where appropriate.
Frequency | Sense Letter | Example |
16660 | a | |
16580 | b | |
4307 | c | |
1214 | d | |
409 | e | |
153 | f | |
65 | g | |
30 | h | |
12 | i | |
10 | j | |
4 | l | |
4 | k | |
2 | n | call |
1 | m | call |
Finally, in addition to having senses and subsenses, there are cases where we have sub-sub-senses. These are indicated by a subsense number. To completely distinguish these case, one must give a sense number, a sense letter, and the subsense number (as in definition 3b1).
Frequency | Sense Subnumber | Example |
1739 | 2 | |
1737 | 1 | |
278 | 3 | |
69 | 4 | |
19 | 5 | |
3 | 6 | |
2 | 7 | good, stone |
1 | 8 | stone |
Although the main entry card (F-card) indicated the part of speech for each entry, we actually could have had two parts of speech on the F-card: the part of speech (F6) and the secondary part of speech (F8). The D5 field specifies the part of speech for this particular definition. This will normally be one of the two parts of speech from the F-card. However, if the main part of speech on the F card was a verb (vb), then this field may further classify the verb as either a transitive (vt) or intransitive (vi) verb, an auxiliary verb (va), or an imperative verb (vp).
Notice that only the three entries brew (sense 1), lengthen, and tamper use the vb part of speech for the definition; all other entries use a more specific part of speech.
Frequency | Part of Speech | Meaning |
80022 | n | noun |
26084 | aj | adjective |
19408 | vt | verb, transitive |
8883 | vi | verb, intransitive |
2320 | av | adverb |
863 | cf | combining form |
552 | pp | preposition |
372 | vb | verb |
225 | pn | pronoun |
225 | pf | prefix |
223 | ns | noun suffix |
200 | cj | conjunction |
188 | nc | noun combining form |
128 | tm | trademark |
98 | ij | interjection |
89 | js | adjective suffix |
65 | va | verbal auxiliary |
26 | vs | verb suffix |
21 | da | definite article |
11 | vp | verb imperative |
8 | as | adjective suffix |
7 | ia | indefinite article |
4 | sf | suffix |
2 | vm | verb impersonal, past |
2 | vc | verb combining form |
2 | np | noun plural suffix |
1 | is | interjection suffix |
The remainder of the D-card is the definition of the indicated sense
of the dictionary entry. The definitions were transcribed
directly from the printed text.
There are
135,973 unique values for this field. It appears that some (at
least 7656 of them) of these values contain embedded semi-colons.
Semi-colons after Field D6 are part of the dictionary text, and not
field delimiters.
The R cards specify words or phrases which are related to the main entry.
Frequency | Code | Meaning |
19123 | R | Related words |
The related word entry is similar to the F2 and V2 fields. There are 19049 distinct values for this field. We have the same sort of special cases:
Hyphenation information is encoded as with fields F5 and V3. The hyphenation spans run from 1 to 9, with a few having values of A, B, C, D, and G. The G value is for will-o'-the-wispish. The longest field for R3 is for interdenominationalism;23231242. There are 845 distinct values of this field.
This field gives the part of speech of the indicated related word, similar to Field F6.
Frequency | Code | Part of Speech | Example |
7981 | n | noun | |
5258 | aj | adjective | |
4838 | av | adverb | |
185 | vt | verb, transitive | |
102 | vb | verb | |
96 | vi | verb, intransitive | |
2 | nc | noun combining form | |
2 | ac | adjective combining form | -cratic |
1 | pp | preposition | midst |
Note that the ac code is not used in Field F6.
This field is similar to Field F7, and indicates the relationship between fields R4 and R6.
Frequency | Code | Part of Speech Joiner |
340 | 1 | Equal |
6 | 2 | Secondary |
The six fields with a value of 2 are related to AWOL, crescendo, darn, half hour, pitter-patter, and slant.
This field is similar to Field F8.
Frequency | Code | Secondary Part of Speech | Example |
178 | n | noun | |
112 | aj | adjective | |
34 | pn | pronoun | |
20 | av | adverb | |
1 | vb | verb | cross-reference |
1 | pp | preposition | mid |
The X card indicates a cross reference between the previous main entry (F-card) and the given word.
Frequency | Code | Meaning |
4596 | X | Cross reference |
The X2 field is similar to the F2, R2, and V2 fields. The entries may contain capital letters, blanks, hyphens, and some special characters (including periods, parenthesis, numbers, and typesetting codes). For each cross reference, the type of cross reference is encoded in Field X5. Fields X3 and X4 help to specify exactly to which of the possibly multiple senses of the cross-referenced word reference is being made.
There are 1841 unique entries for Field X2.
If the cross-reference word has homographs, we need to know which is being referred to. This entry matches the F3 value.
Frequency | Superscript | Example |
3 | 1 | his, herself, -s |
1 | 3 | squabble |
Once a cross-reference entry has been defined, it may be important to specify which of the possibly multiple senses is being referred to. Following the word may be a sense specifier. This can indicate simply a sense number, or a sense number and sense letter. This seems particularly important when a comparison is being made. For example, consider the cross reference to base under alkali or to meta-, orth-, and para- under benzene ring.
Frequency | Subscript | Example |
5 | 1 | |
2 | 1a | |
2 | 1b | |
1 | 2 | |
2 | 2a | |
2 | 2b | |
2 | 3 | |
2 | 3b | |
1 | 4b | |
1 | 7 |
The original report from SDC [2] indicates that there may be 11 different values for this field; we found that only the following 8 actually occur.
Frequency | Code | Meaning | Example |
2996 | 9 | [bold syn] see <X2> | averse |
571 | 6 | called also <X2> | dipper |
393 | 4 | compare <X2> | averse |
389 | 1 | see *** table | aluminum |
103 | 0 | see <X2> | anth- |
90 | 8 | [bold syn] see in addition <X2> | abandon |
53 | 3 | see <X6> at [mini MONEY] table | agora |
1 | 5 | compare [mini ELEMENT] table | rare earth |
The symbol <X2> means the contents of field X2 on this card; the symbol <X6> means the contents of field X6. The *** of code 1 can stand for money or element (at least) and is not recorded.
As with some of the other fields, it is not clear that this field has been consistently applied. There are still some occurrences of called also in the definitions, although these are generally scattered throughout only those words starting with an a, b, or c. We found 112 occurrences of called also [italic ...] in the files. See, for example, the entries for Haitian, hermorrhoid, hydroxide ion, and road.
Each cross reference with an encoding of 3 in Field X5 is of the form See <X6> at [mini MONEY] table, where <X6> is some form of money. This field contains the form of money. The most common form is pound (8 occurrences), followed by four occurrences of dinar and two occurrences each of escudo, franc, krona, rupee, and yuan. There is one occurrence of each of afghani, baht, balboa, bolivar, colon, cruzeiro, deutsche mark, drachma, forint, gourde, guarani, hwan, kip, koruna, krone, kyat, lek, leu, lev, lira, markka, peseta, peso, riel, riyal, ruble, rupiah, schilling, somalo, yen, and zloty.
At various points in an entry, a label may appear. The L-cards transcribe these labels
Frequency | Code | Meaning |
11992 | L | Label |
To specify the entry which is being labeled, we must specify the sense number, letter, and subnumber as appropriate. In general, it is expected that the definition being labeled would follow the label, but this may not always be the case.
Frequency | Sense Number | Example |
2271 | 1 | |
1935 | 2 | |
1317 | 0 | |
658 | 3 | |
278 | 4 | |
148 | 5 | |
99 | 6 | |
50 | 7 | |
32 | 8 | |
20 | 9 | |
9 | 12 | |
9 | 11 | |
9 | 10 | |
7 | 13 | |
4 | 14 | |
3 | 15 | |
1 | 19 | life |
1 | 21 | set |
1 | 24 | strike |
This field is similar to Field D3.
Frequency | Sense Letter | Example |
700 | b | |
560 | a | |
211 | c | |
52 | d | |
12 | e | |
4 | f | |
3 | h | dead, matter, what |
1 | g | matter |
This field is similar to Field D4.
Frequency | Subsense Number | Example |
71 | 2 | |
49 | 1 | |
8 | 3 | |
3 | 4 | number, objective, rate |
The label itself is a short phrase indicating some special property of the entry. Below are the most common properties listed in the labels. These are often combined, as in chiefly Scot, often cap M, and sing but pl in constr. There are 1494 distinct values.
Frequency | Label Text | Meaning |
3051 | [italic pl] | plural |
2040 | [italic archaic] | |
1108 | [italic obs] | obsolete |
534 | [italic often attrib] | attribute |
465 | [italic often cap] | capitalized |
392 | [italic cap] | capitalized |
375 | [italic slang] | |
307 | [italic chiefly Brit] | British |
293 | [italic Brit] | British |
289 | [italic chiefly Scot] | Scottish |
222 | [italic pl but sing or pl in constr] | plural but singular or plural in construction |
183 | [italic chiefly dial] | dialect |
154 | [italic Scot] | Scottish |
103 | [italic dial] | dialect |
94 | [italic pl but sing in constr] | plural but singular in construction |
87 | [italic dial Brit] | British dialect |
39 | [italic dial Eng] | English dialect |
32 | [italic often not cap] | often not capitalized |
28 | [italic often cap], [italic often attrib] | |
26 | [italic not cap] |
The S-cards introduce a synonym block. Each synonym block is a short paragraph consisting of two parts. The first part gives a list of words which are synonyms. The second part gives information about the differences between the words. We have encoded these two parts in the S2 field. The list of synonyms is given in the first field, with an S2 value of 0. The remaining S cards have S2 values starting at 1 and increasing by one for each explanation, as in the following example, which follows dark.
S;0;[mini DIM], [mini DUSKY], [mini MURKY], [mini GLOOMY]: S;1;[mini DARK], the general term, implies utter or virtual lack of# illumination; S;2;[mini DIM] suggests too weak a light for things to be seen clearly or# distinctly; S;3;[mini DUSKY] suggests deep twilight and close approach to darkness; S;4;[mini MURKY] implies a heavy darkness such as that caused by smoke, fog,# or dust in air or mud in water; S;5;[mini GLOOMY] implies the serious interference with the normal radiation# of daylight caused by dense clouds and connotes cheerlessness and pessimismThe list on the S;0; cards may or may not include the main entry which the synonym block follows. That is, notice that the above example synonym block follows the entry for dark, and includes dark on the S;1; card. However, dark is not listed on the S;0; card. The lists were transcribe directly from the printed copy. Of the 835 S cards, 331 have a list without the main entry. The remaining 502 cards have a complete list, including (as the first word in the list) the main entry.
Frequency | Code | Meaning |
835 | S | Synonym card |
There is basically one S card for each word in the S;0 list. However, some entries (like alone) will merge two words into one S card. Other entries (like convert) have more than one entry per list word.
Most of these fields start with a mini-caps word, although there are some (like cloister) where entries begin with an article (a or an). Again there are internal semicolons, which should not be treated as field delimiters.
The following is a list and explanation of the various special characters defined using the brace notation. For symbols which are not in the ASCII character set, or those which occur only infrequently, we have selected a name for the character. The name is enclosed in braces, and should be interpreted as a single character.
The first group of characters are those alphabetic characters which may have diacritical marks, such as a grave accent, accute accent, umlaut, carat, or tilda. These are encoded by placing the alphabetic character and its accent mark in braces.
Below we give the symbol, the number of times that it occurs, and an entry word giving an example. Notice that in come cases, such as the entry for {C,}, the symbol is used not in the example (compendium), but in the definition of the example.
Special codes are used in fields F2, E2, P2, V2, D6, R2, L5, and S2.
Frequency | Special Code | Example Word | Usage |
2 | {'} | [Dhu'l-Qa`dah] | E;Ar [italic Dh{u-}-l-qa{'}dah], lit., ... |
61 | {a'} | [Aymara] | E;Sp [italic aymar{a'}] |
10415 | {a-} | [a] | E;ME, fr. OE [italic {a-}n] one -- more at [mini ONE] |
7 | {A-} | [Ab] | E;Heb [italic {A-}bh] |
135 | {a.} | [horselaugh] | P;'h{o.}r-.slaf, -.sl{a.}f |
11752 | {a:} | [Aaronic] | P;a-'r{a:}n-ik, e- |
206 | {ae-} | [abet] | E; akin to OE [italic b{ae-}tan] to bait |
2 | {AE} | [Aegir] | E;ON [italic {AE}gir] |
303 | {ae} | [acorn] | E;ME [italic akern], fr. OE [italic {ae}cern] |
15 | {a^} | [ch{a^}teau] | F;ch{a^}teau |
1 | {A`} | [vis-{a`}-vis] | D;[mini T{E^}TE-{A`}-T{E^}TE] |
42 | {a`} | [gardyloo] | E;perh. fr. F [italic garde {a`} l'eau{exclamation-point}] ... |
50 | {a~} | [Bisayan] | E;Bisayan [italic Bisay{a~}] |
1 | {c'} | [Moho] | E;Andrija [italic Mohorovi{c-hachek}i{c'} fl] 1908 ... |
3 | {C,} | [langue d'oc] | D;[mini PROVEN{C,}AL] |
57 | {c,} | [aper{c,}u] | F;aper{c,}u |
1012 | {e'} | [-ability] | E;..., fr. MF [italic -abilit{e'}], ... |
18 | {E'} | [Beaum{e'}] | L;[italic var of] [mini BAUM{E'}] |
11 | {E-} | [Eden] | E;LL, fr. Heb '[italic {E-}dhen] |
24706 | {e-} | [ABC] | P;.{a-}-(.)b{e-}-'s{e-} |
11 | {e.} | [hoop] | E;..., Lith [italic kab{e.}] hook |
13 | {e:} | [Chlo{e:}] | F;Chlo{e:} |
26 | {e^} | [ar{e^}te] | F;ar{e^}te |
2 | {E^} | [vis-{a`}-vis] | D;[mini T{E^}TE-{A`}-T{E^}TE] |
190 | {e`} | [ampere] | E;Andre M. [italic Amp{e`}re] {dagger}1836 ... |
3 | {E`} | [calash] | D;[mini CAL{E`}CHE] 2 |
75208 | {e} | [Aaron] | P;'ar-{e}n, 'er- |
1 | {E~} | [arris] | E;... -- more at [mini AR{E~}TE] |
23 | {e~} | [-ese] | E;Pg [italic -{e~}s] & It [italic -ese], fr. L [italic -ensis] |
29 | {i'} | [agouti] | E;F, fr. Sp [italic agut{i'}], fr. Guarani |
1 | {I-} | [Jezebel] | E;Heb [italic {I-}zebhel] |
9224 | {i-} | [Abib] | E;Heb [italic {A-}bh{i-}bh], lit., ear of grain |
59 | {i:} | [abbey] | E;ME, fr. OF [italic aba{i:}e], ... |
5 | {I:} | [natural] | S;[bold syn] [mini INGENUOUS], [mini NA{I:}VE], ... |
1 | {I^} | [ma{i^}tre d'] | D;[mini MA{I^}TRE D'H{O^}TEL] |
5 | {i^} | [ma{i^}tre d'] | F;ma{i^}tre d';;;3;n;; |
2 | {i`} | [lame] | E;..., Lith [italic l{i`}mti] to break down |
8 | {i~} | [Cistercian] | E;ML [italic Cistercium] C{i~}teaux |
59 | {k_} | [Alberich] | P;'{a:}l-b{e}-ri{k_} |
6 | {m.} | [Sankhya] | E;Skt [italic s{a-}{m.}kyha], lit., based on calculation |
1 | {n,} | [friction] | E;..., Skt [italic bhr{i-}{n,}anti] they injure |
21 | {n.} | [junk] | E;Pg [italic junco], fr. Jav [italic jo{n.}] |
1 | {n:} | [pogy] | E; akin to Abnaki [italic p8ka{n:}gan] menhaden |
3841 | {ng} | [Aberdeen Angus] | P;.ab-{e}r-.d{e-}-'na{ng}-g{e}s |
60 | {n~} | [balboa] | E;Sp, fr. Vasco N{u'}{n~}ez de [italic Balboa] ... |
11138 | {o-} | [Aaron] | E;LL, fr. Gk [italic Aar{o-}n], ... |
7 | {O-} | [Obadiah] | E;Heb [italic '{O-}bhadhy{a-}h] |
6381 | {o.} | [holocaust] | P;'h{a:}l-{e}-.k{o.}st, 'h{o-}-l{e}- |
93 | {o:} | [acanth-] | E; akin to ON [italic {o:}gn] awn ... |
1 | {oe} | [diphthong] | D;a form of the ligature {ae} or {oe} |
3 | {O^} | [dinner] | D;[mini TABLE D'H{O^}TE] 2 |
5 | {o^} | [entrep{o^}t] | F;entrep{o^}t |
50 | {o`} | [Oneida] | E;Iroquois [italic On{e-macron-tilda}y{o`}de'], ... |
21 | {o~} | [alcove] | E;F [italic alc{o~}ve], fr. Sp [italic alcoba], ... |
5 | {r_} | [betel] | E;Pg, fr. Tamil [italic ve{r_}{r_}lai] |
6 | {s'} | [Israel] | E;ME, ..., fr. Heb [italic Yi{s'}r{a-}'{e-}l] |
7 | {S'} | [Saiva] | E;Skt [italic {S'}aiva], fr. [italic {S'}iva] Siva |
7 | {s,} | [bashi-bazouk] | E;Turk [italic ba{s,}i bozuk] irregular soldier |
45 | {s`} | [ashram] | E;Skt [italic {a-}{s`}rama], ... |
533 | {th_} | [airworthiness] | P;-.w{e}r-{th_}{e-}-n{e}s |
27 | {u'} | [Chinook] | E;Chehalis [italic Tsin{u'}k] |
1 | {U-} | [Uriel] | E;Heb [italic {U-}r{i-}'{e-}l] |
465 | {u-} | [Algol] | E;Ar [italic al-gh{u-}l], lit., the ghoul |
3337 | {u.} | [aardwolf] | P;-.w{u.}lf |
1 | {U:} | [Geiger counter] | D;[mini GEIGER-M{U:}LER TUBE] |
4376 | {u:} | [abduce] | P;ab-'d(y){u:}s |
1 | {ue-} | [poilu] | P;pw{a:}l-'(y){u:}, 'pw{a:}l-., pw{a.}-l{ue-} |
4 | {u`} | [Mohawk] | E; akin to Narraganset [italic Mohowa{u`}uck] |
13 | {u~} | [Nebuchadnezzar] | E;..., modif. of Bab [italic Nab{u~}-kudurri-u{s-sub-dot}ur] |
79 | {y-} | [balefire] | E;ME, fr. OE [italic b{ae-}lf{y-}r] funeral fire, ... |
Frequency | Special Code | Example Word | Usage |
1 | {ALPHA} | [alpha] | D;the first letter of the Greek alphabet -- symbol {ALPHA} or {alpha} |
5 | {alpha} | [Altair] | D;the first magnitude star Alpha ({alpha}) Aquilae |
1 | {BETA} | [beta] | D;the second letter of the Greek alphabet -- symbol {BETA} or {beta} |
2 | {beta} | [beta] | D;the second letter of the Greek alphabet -- symbol {BETA} or {beta} |
1 | {CHI} | [chi] | D;the 22nd letter of the Greek alphabet -- symbol {CHI} or {chi} |
3 | {chi} | [chi] | D;the 22nd letter of the Greek alphabet -- symbol {CHI} or {chi} |
2 | {delta} | [delta] | D;the fourth letter of the Greek alphabet -- symbol {DELTA} or {delta} |
3 | {DELTA} | [delta] | D;an increment of a variable -- symbol {DELTA} |
1 | {EPSILON} | [epsilon] | D;the 5th letter of the Greek alphabet -- symbol {EPSILON} or {epsilon} |
2 | {epsilon} | [epsilon] | D;the 5th letter of the Greek alphabet -- symbol {EPSILON} or {epsilon} |
1 | {eta} | [eta] | D;the 7th letter of the Greek alphabet -- symbol {ETA} or {eta} |
3 | {ETA} | [IHS] | E;LL, part transliteration of Gk {IOTA}{ETA}{SIGMA}, ... |
1 | {GAMMA} | [gamma] | D;the third letter of the Greek alphabet -- symbol {GAMMA} ... |
3 | {gamma} | [gamma] | D;the third letter of the Greek alphabet -- symbol ... {gamma} |
2 | {iota} | [iota] | D;the 9th letter of the Greek alphabet -- symbol {IOTA} or {iota} |
4 | {IOTA} | [IHS] | E;LL, part transliteration of Gk {IOTA}{ETA}{SIGMA}, ... |
1 | {KAPPA} | [kappa] | D;the 10th letter of the Greek alphabet -- symbol {KAPPA} or {kappa} |
3 | {kappa} | [gamma nasal] | D;gamma pronounced \\{ng}\\, before {kappa}, {chi}, {xi}, or ... |
1 | {LAMBDA} | [lambda] | D;the 11th letter of the Greek alphabet -- symbol {LAMBDA} ... |
1 | {lambda} | [lambda] | D;the 11th letter of the Greek alphabet -- symbol ... {lambda} |
1 | {MU} | [mu] | D;the 12th letter of the Greek alphabet -- symbol {MU} or {mu} |
1 | {mu} | [mu] | D;the 12th letter of the Greek alphabet -- symbol {MU} or {mu} |
1 | {NU} | [nu] | D;the 13th letter of the Greek alphabet -- symbol {NU} or {nu} |
2 | {nu} | [nu] | D;the 13th letter of the Greek alphabet -- symbol {NU} or {nu} |
1 | {OMEGA} | [omega] | D;the 24th letter of the Greek alphabet -- symbol {OMEGA} or {omega} |
3 | {omega} | [omega] | D;the 24th letter of the Greek alphabet -- symbol {OMEGA} or {omega} |
1 | {omicron} | [omicron] | D;the 15th letter ... -- symbol {OMICRON} or {omicron} |
2 | {OMICRON} | [IHS] | E;LL, ... {IOTA}{ETA}{SIGMA}{OMICRON}{UPSILON}{SIGMA} ... |
1 | {PHI} | [phi] | D;the 21st letter of the Greek alphabet -- symbol {PHI} or {phi} |
1 | {phi} | [phi] | D;the 21st letter of the Greek alphabet -- symbol {PHI} or {phi} |
1 | {PI} | [pi] | D;the 16th letter of the Greek alphabet -- symbol {PI} or {pi} |
6 | {pi} | [gilbert] | D;the cgs unit ... equivalent to 10{division-symbol}4{pi} ampere-turn |
1 | {PSI} | [psi] | D;the 23rd letter of the greek alphabet -- symbol {PSI} or {psi} |
1 | {psi} | [psi] | D;the 23rd letter of the greek alphabet -- symbol {PSI} or {psi} |
1 | {RHO} | [rho] | D;the 17th letter of the Greek alphabet -- symbol {RHO} or {rho} |
4 | {rho} | [rho] | D;the 17th letter of the Greek alphabet -- symbol {RHO} or {rho} |
1 | {sigma-1} | [sigma] | D;the 18th letter ... -- symbol {SIGMA} or {sigma-1} or {sigma-2} |
1 | {sigma-2} | [sigma] | D;the 18th letter ... symbol {SIGMA} or {sigma-1} or {sigma-2} |
1 | {sigma} | [rough breathing] | D;a mark {rough-breathing-mark} used in Greek ... |
4 | {SIGMA} | [IHS] | E;LL, part transliteration of Gk {IOTA}{ETA}{SIGMA}, ... |
1 | {TAU} | [tau] | D;the 19th letter of the Greek alphabet -- symbol {TAU} or {tau} |
3 | {tau} | [rough breathing] | D;a mark ... {eta-accent}{tau}{omega}{rho} ... |
1 | {THETA} | [theta] | D;the 8th letter of the Greek alphabet -- symbol {THETA} or {theta} |
1 | {theta} | [theta] | D;the 8th letter of the Greek alphabet -- symbol {THETA} or {theta} |
2 | {upsilon} | [hyoid bone] | E; ... like the letter upsilon ({UPSILON}, {upsilon}), ... |
3 | {UPSILON} | [IHS] | E;LL, ... {ETA}{SIGMA}{OMICRON}{UPSILON}{SIGMA} ... |
1 | {XI} | [xi] | D;the fourteenth letter of the Greek alphabet -- symbol {XI} or {xi} |
2 | {xi} | [gamma nasal] | D;gamma pronounced \\{ng}\\, before {kappa}, {chi}, {xi}, ... |
1 | {ZETA} | [zeta] | D;the 6th letter of the Greek alphabet -- symbol {ZETA} or {zeta} |
1 | {zeta} | [zeta] | D;the 6th letter of the Greek alphabet -- symbol {ZETA} or {zeta} |
Frequency | Special Code | Example Word | Usage |
4 | {asterick} | [asterisk] | D;the character {asterick} used in printing ... |
15 | {by} | [Microcard] | D;... card approximately 3 in. {by} 5 in. ... |
3 | {center-dot} | [congruent] | D;... since 12 {minus} 2 {equals} 2 {center-dot} 5> |
1 | {check} | [check] | D;a mark typically {check} placed beside an item ... |
1 | {close-double-curly-quotes} | [quotation mark] | D;one of ... {close-double-curly-quotes} ... |
1 | {close-double-straight-quotes} | [quotation mark] | D;one of ... {close-double-straight-quotes} ... |
1 | {close-single-curly-quotes} | [quotation mark] | D;one of ... {close-single-curly-quotes} ... |
1 | {close-single-straight-quotes} | [quotation mark] | D;one of ... {close-single-straight-quotes} |
428 | {dagger} | [Abelian group] | E;Niels [italic Abel] {dagger}1829 Norw ... |
75 | {degrees} | [British thermal unit] | D;the quantity of heat ... 39.2{degrees}F |
9 | {diagonal} | [and{diagonal}or] | F;and{diagonal}or |
3 | {division-symbol} | [gilbert] | D;the cgs unit ... 10{division-symbol}4{pi} ... |
16 | {dollar-mark} | [c] | D;a sum of {dollar-mark}100 |
1 | {double-arrows} | [equation] | D;an expression ... of the sign {double-arrows} ... |
1 | {double-bar-dollar-mark} | [dollar mark] | D;a mark {double-bar-dollar-mark} or ... |
4 | {double-bond} | [oxime] | D;... {two-single-bonds}C{double-bond}NOH |
1 | {double-dagger} | [double dagger] | D;the character {double-dagger} used ... |
1 | {double-half-arrows} | [equation] | D;an expression ... {double-half-arrows} if ... |
1 | {double-hyphen} | [double hyphen] | D;a punctuation mark {double-hyphen} used ... |
33 | {equals} | [acrylonitrile] | D;a ... nitrile CH[sub 2]{equals}CHCN used ... |
19 | {exclamation-point} | [bobbery] | E;Hindi ... oh father{exclamation-point} |
1 | {fist} | [index] | D;a character {fist} used ... |
1 | {flat-sign} | [flat] | D;a character {flat-sign} on ... |
1 | {horizontal-brace} | [brace] | D;a mark ... {horizontal-brace} used ... |
6 | {inches} | [punchboard] | D;a small board usu. 6{inches}{by}6{inches} ... |
1 | {integral-sign} | [sign] | D;a character ... {integral-sign}) indicating ... |
1 | {inverted-question-mark} | [cooncan] | E;... fr. Sp {inverted-question-mark} ... |
1 | {left-brace} | [brace] | D;a mark {left-brace} or ... |
5 | {left-bracket} | [-ium] | D;positive ion ... {left-bracket} ... |
2 | {left-square-bracket} | [bracket] | D;one of a pair of marks {left-square-bracket} ... |
20 | {minus} | [Dry Ice] | D;-- used ... at {minus}78.5{degrees}C ... |
20 | {minutes} | [east by north] | D;a compass point ... N 78{degrees}45{minutes} E |
1 | {natural-sign} | [natural] | D;the character or sign {natural-sign} ... |
1 | {open-double-curly-quotes} | [quotation mark] | D;one of a pair ... {open-double-curly-quotes} ... |
1 | {open-double-straight-quotes} | [quotation mark] | D;one of a pair ... {open-double-straight-quotes} ... |
1 | {open-single-curly-quotes} | [quotation mark] | D;one of a pair ... {open-single-curly-quotes} ... |
1 | {open-single-straight-quotes} | [quotation mark] | D;one of a pair ... {open-single-straight-quotes} ... |
1 | {paragraph-mark} | [paragraph] | D;a character (as {paragraph-mark}) used ... |
1 | {parallel} | [parallel] | D;a character {parallel} used ... |
1 | {percent} | [locker] | D;a compartment ... at 80{percent} relative humidity |
2 | {plus-or-minus} | [functor] | D;an expression (as {plus-or-minus} or ... |
14 | {plus} | [alkyl] | D;... radical C[sub n]H[sub 2[italic n]{plus}1] |
1 | {prime} | [prime] | D;the symbol {prime} |
26 | {question-mark} | [anadiplosis] | D;... honor such as his{question-mark} ... |
1 | {radical-sign} | [radical sign] | D;the sign {radical-sign} ... |
1 | {radical} | [sign] | D;a character (as {division-symbol}, {radical}, or ...) |
1 | {right-arrow} | [equation] | D;an expression ... {right-arrow} or ... |
1 | {right-brace} | [brace] | D;a mark {left-brace} or {right-brace} or ... |
5 | {right-bracket} | [-ium] | D;positive ... H{right-bracket}[sup +]> |
2 | {right-square-bracket} | [bracket] | D;one of ... {right-square-bracket} ... |
2 | {seconds} | [obliquity] | D;... 23{degrees}26{minutes}40{seconds}.16 ... |
1 | {section-mark} | [section] | D;a character {section-mark} commonly used ... |
4 | {sharp} | [sharp] | D;a character {sharp} on a line or ... |
2 | {single-bond} | [enol] | D;... {two-single-bonds}C:C(OH){single-bond} |
1 | {space-mark} | [space mark] | D;the symbol {space-mark} |
1 | {spade} | [spade] | D;a black figure {spade} on each playing card ... |
1 | {squareroot-of-3} | [surd] | D;an irrational root (as {squareroot-of-3}) |
2 | {squareroot-of-minus-one} | [complex number] | D;a number ... [italic b{squareroot-of-minus-one}] ... |
3 | {stacked-commas} | [ivory] | E;... fr. Egypt {stacked-commas}[italic b], ... |
1 | {ten-to-ten-to-hundred} | [googolplex] | D;1 followed by ... to {ten-to-ten-to-hundred} |
8 | {times} | [curie] | D;a unit quantity ... 3.7 {times} 10[sup 10] ... |
1 | {triple-bond} | [acetylene] | D;a ... hydrocarbon Hc{triple-bond}ch ... |
8 | {two-single-bonds} | [acetal] | D;... the grouping {two-single-bonds}C(OR)[sub 2] ... |
1 | {underbar} | [heulandite] | D;a zeolite (Na,Ca)[sub 4{underbar}6]Al[sub 6]... |
Frequency | Special Code | Example Word | Usage |
1 | {aleph} | [aleph] | D;the 1st letter of the Hebrew alphabet -- symbol {aleph} |
1 | {ayin} | [ayin] | D;the 16th letter of the Hebrew alphabet -- symbol {ayin} |
1 | {beth} | [beth] | D;the 2d letter of the Hebrew alphabet -- symbol {beth} |
1 | {daleth} | [daleth] | D;the fourth letter of the Hebrew alphabet -- symbol {daleth} |
1 | {gimel} | [gimel] | D;the third letter of the Hebrew alphabet -- symbol {gimel} |
1 | {heth} | [heth] | D;the 8th letter of the Hebrew alphabet -- symbol {heth} |
1 | {he} | [he] | D;the 5th letter of the Hebrew alphabet -- symbol {he} |
1 | {kaph(final)} | [kaph] | D;the 11th letter of the Hebrew alphabet -- symbol {kaph} or {kaph(final)} |
1 | {kaph} | [kaph] | D;the 11th letter of the Hebrew alphabet -- symbol {kaph} or {kaph(final)} |
1 | {lamed} | [lamed] | D;the 12th letter of the Hebrew alphabet -- symbol {lamed} |
1 | {mem(final)} | [mem] | D;the 13th letter of the Hebrew alphabet -- symbol {mem} or {mem(final)} |
1 | {mem} | [mem] | D;the 13th letter of the Hebrew alphabet -- symbol {mem} or {mem(final)} |
1 | {nun(final)} | [nun] | D;the 14th letter of the Hebrew alphabet -- symbol {nun} or {nun(final)} |
1 | {nun} | [nun] | D;the 14th letter of the Hebrew alphabet -- symbol {nun} or {nun(final)} |
1 | {pe(final)} | [pe] | D;the 17th letter of the Hebrew alphabet -- symbol {pe} or {pe(final)} |
1 | {pe} | [pe] | D;the 17th letter of the Hebrew alphabet -- symbol {pe} or {pe(final)} |
1 | {qoph} | [qoph] | D;the 19th letter of the Hebrew alphabet -- symbol {qoph} |
1 | {resh} | [resh] | D;the 20th letter of the Hebrew alphabet -- symbol {resh} |
2 | {sadhe(final)} | [sadhe] | D;the 18th letter of the Hebrew alphabet -- symbol {sadhe} or {sadhe(final)} |
2 | {sadhe} | [sadhe] | D;the 18th letter of the Hebrew alphabet -- symbol {sadhe} or {sadhe(final)} |
1 | {samekh} | [samekh] | D;the 15th letter of the Hebrew alphabet -- symbol {samekh} |
1 | {shin} | [shin] | D;the 22nd letter of the Hebrew alphabet -- symbol {shin} |
1 | {sin} | [sin] | D;the 21st letter of the Hebrew alphabet -- symbol {sin} |
1 | {taw} | [taw] | D;the 23d letter of the Hebrew alphabet -- symbol {taw} |
1 | {teth} | [teth] | D;the 9th letter of the Hebrew alphabet -- symbol {teth} |
1 | {waw} | [waw] | D;the 6th letter of the Hebrew alphabet -- symbol {waw} |
1 | {yod} | [yod] | D;the 10th letter of the Hebrew alphabet -- symbol {yod} |
1 | {zayin} | [zayin] | D;the 7th letter of the Hebrew alphabet -- symbol {zayin} |
Frequency | Special Code | Example Word | Usage |
15 | {a-breve} | [Aaron] | E;LL, fr. Heb [italic Ah{a-breve}r{o-}n] |
4 | {A-breve} | [Amalekite] | E;... pl. fr. [italic '{A-breve}m{a-}l{e-}q] Amalek ... |
1 | {a-cidilla} | [thong] | E;... Av [italic thw{a-cidilla}z{j-hachek}aiti] ... |
1 | {a-hachek} | [Tuscarora] | E;Tuscarora [italic Sk{a-hachek}-r{u-}-r{e-hachek}n], ... |
8 | {a-macron-tilda} | [bandanna] | E;Hindi [italic b{a-macron-tilda}dhn{u-}] ... |
1 | {a-sub-dot} | [Hasid] | E;Heb [italic h{a-sub-dot}{a-}s{i-}dh] pious |
1 | {A-sup-circle} | [angstrom] | E;Anders J. [italic {A-sup-circle}ngstr{o:}m] |
7 | {a-sup-circle} | [desman] | E;short for Sw [italic desmansr{a-sup-circle}tta], ... |
2 | {accent-acute} | [accent] | D;a mark (as {accent-acute}, {accent-grave}, {circumflex}) ... |
3 | {accent-grave} | [accent] | D;a mark (as {accent-acute}, {accent-grave}, {circumflex}) ... |
1 | {alpha-smooth-breathing} | [smooth breathing] | D;a mark ... (as in {alpha-smooth-breathing}{gamma}... |
1 | {asterism-down} | [asterism] | D;three asterisks ... {asterism-down}) ... |
1 | {asterism-up} | [asterism] | D;three asterisks ... (as {asterism-up} ... |
1 | {b-minus-c-overbar} | [vinculum] | D;a ... mark ... (as in a {minus} {b-minus-c-overbar} ... |
1 | {b-sup-macron} | [Verner's law] | D;0;;;n;a statement ... fricatives {b-sup-macron}, ... |
7 | {breve} | [Adar Sheni] | E;Heb [italic A{breve}dh{a-}r Sh{e-}n{i-}] second Adar |
1 | {C-hachek} | [Czech] | E;Czech [italic {C-hachek}ech] |
7 | {c-hachek} | [Comanche] | E; perh. akin to Hopi [italic kom{a'}n{c-hachek}i] scalp lock |
1 | {c-with-vertical-bar} | [alla breve] | D;the sign {c-with-vertical-bar} marking ... |
3 | {circumflex} | [accent] | D;a mark (as ... {circumflex}) used ... |
1 | {curcumflex} | [circumflex] | D;a mark {circumflex}, {curcumflex}, or ... |
1 | {curly-A} | [swash] | D;... <...{curly-A} {curly-R} {curly-P} {curly-N}> |
1 | {curly-N} | [swash] | D;... <...{curly-A} {curly-R} {curly-P} {curly-N}> |
1 | {curly-P} | [swash] | D;... <...{curly-A} {curly-R} {curly-P} {curly-N}> |
1 | {curly-R} | [swash] | D;... <...{curly-A} {curly-R} {curly-P} {curly-N}> |
9 | {d-sub-dot} | [drub] | E;perh. fr. Ar [italic {d-sub-dot}araba] |
2 | {E-breve} | [Elisha] | E;Heb [italic {E-breve}l{i-}sh{a-}'] |
63 | {e-breve} | [Ashkenazi] | E;Heb [italic Ashk{e-breve}n{a-}z{i-}] |
5 | {e-cidilla} | [cinder] | E; akin ... OSlav [italic s{e-cidilla}dra] stalactite |
25 | {e-hachek} | [Belial] | E;Gk, fr. Heb b{e-hachek}l{i-}ya'al ... |
1 | {e-macron-tilda} | [Oneida] | E;Iroquois [italic On{e-macron-tilda}y{o`}de'], ... |
2 | {edh} | [Verner's law] | D;0;;;n;a ... {edh}, {g-sub-macron}, and [italic z] ... |
1 | {eta-accent} | [rough breathing] | D;a mark ... {eta-accent}{tau}{omega}{rho} ... |
1 | {fermata-down} | [fermata] | D; the sign {fermata-up} or {fermata-down} ... |
1 | {fermata-up} | [fermata] | D; the sign {fermata-up} or {fermata-down} ... |
2 | {g-hachek} | [shagreen] | E;by ... fr. Turk [italic sa{g-hachek}ri] |
1 | {g-sub-macron} | [Verner's law] | D;0;;;n;a ... {g-sub-macron}, and [italic z] represented ... |
2 | {h-sub-breve} | [ankh] | E;Egypt [italic 'n{h-sub-breve}] |
38 | {h-sub-dot} | [Apis] | E;L, fr. Gk, fr. Egypt [italic {h-sub-dot}p] |
3 | {H-sub-dot} | [Heshvan] | E;Heb [italic {H-sub-dot}eshw{a-}n] |
1 | {hachek} | [Judah] | E;Heb [italic Ye{hachek}h{u-}dh{a-}h] |
20 | {i-breve} | [beluga] | E;Russ [italic belukha], fr. [italic bely{i-breve}] |
3 | {i-hachek} | [hap] | E; akin ..., OSlav [italic kob{i-hachek}] augury |
1 | {i-macron-tilda} | [chintz] | E;... fr. Hindi [italic ch{i-macron-tilda}{t-sub-dot}] |
3 | {j-hachek} | [file] | E; akin to Arm [italic {j-hachek}il] sinew |
3 | {l-sub-dot} | [milo] | E;Sotho [italic mai{l-sub-dot}i] |
1 | {macron} | [macron] | D;a mark {macron} placed over a vowel ... |
2 | {n-breve} | [Vaishnava] | E;Skt [italic vai{s-breve}{n-breve}ava] ... |
28 | {n-sub-dot} | [Konkani] | E;Marathi [italic Ko{n.}ka{n-sub-dot}{i-}] |
1 | {o-breve} | [Naomi] | E;Heb [italic N{a-}'{o-breve}m{i-}] |
3 | {o-cidilla} | [permeate] | E; ... OSlav [italic min{o-cidilla}ti] to pass |
1 | {o-sub-dot} | [Romany] | E;Romany ..., fr. Skt [italic d{o-sub-dot}mba] ... |
1 | {omega-rough-breathing} | [rough breathing] | D;a mark ... (as in {omega-rough-breathing}... |
1 | {presa} | [presa] | D;a mark or cue (as :S: or {presa}) ... |
1 | {r-hachek} | [cricetid] | E; akin ... [italic k{r-hachek}e{c-hachek}ek] |
1 | {r-sub-diaeresis} | [coir] | E;Tamil [italic kayi{r-sub-diaeresis}u] rope |
21 | {r-sub-dot} | [Krishnaism] | E;..., Skt [italic K{r-sub-dot}{s-sub-dot}{n-sub-dot}a] |
1 | {rho-rough-breathing} | [rough breathing] | D;a mark ... {rho-rough-breathing}... |
2 | {rough-breathing-mark} | [rough breathing] | D;a mark {rough-breathing-mark} used ... |
2 | {s-breve} | [Vaishnava] | E;Skt [italic vai{s-breve}{n-breve}ava] of Vishnu, ... |
23 | {s-hachek} | [anacrusis] | E; akin to Lith [italic kru{s-hachek}ti] to stamp |
3 | {S-sub-dot} | [Zephaniah] | E;Heb [italic {S-sub-dot}{e-breve}phany{a-}h] |
57 | {s-sub-dot} | [Belshazzar] | E;Heb [italic B{e-}lsha{s-sub-dot}{s-sub-dot}ar] |
8 | {slash-o} | [Boyg] | E;Norw [italic b{slash-o}ig] bugbear |
1 | {slur-down} | [slur] | D;a curved line {slur-up} or {slur-down} ... |
1 | {slur-up} | [slur] | D;a curved line {slur-up} or {slur-down} ... |
2 | {smooth-breathing-mark} | [smooth breathing] | D;a mark {smooth-breathing-mark} placed over ... |
8 | {sub-dot} | [Ananias] | E;... fr. Heb [italic H{sub-dot}a{breve}nany{a-}h] |
65 | {t-sub-dot} | [Altair] | E;Ar [italic al-{t-sub-dot}{a-}'ir], lit., the flier |
1 | {t-sub-macron} | [niter] | E;... fr. Egypt [italic n{t-sub-macron}ry] |
5 | {thorn} | [Verner's law] | D;0;;;n;a statement ... {thorn}, and ... |
1 | {tilda} | [circumflex] | D;a mark {circumflex}, {curcumflex}, or {tilda} ... |
20 | {u-breve} | [apple] | E; ..., OSlav [italic abl{u-breve}ko] |
2 | {u-hachek} | [roof] | E; OSlav [italic strop{u-hachek}] roof |
2 | {umlaut} | [diaeresis] | D;a mark {umlaut} placed over a vowel ... |
1 | {y-breve} | [gum] | E;... fr. Egypt [italic qm{y-breve}t] |
2 | {yogh} | [yogh] | D;a letter {yogh} used ... |
9 | {z-hachek} | [character] | E; akin to Lith [italic {z-hachek}erti] to scratch |
1 | {z-sub-diaeresis} | [nizam] | E;Hindi [italic ni{z-sub-diaeresis}{a-}m] order, ... |
5 | {z-sub-dot} | [nadir] | E;ME, fr. MF, fr. Ar [italic na{z-sub-dot}{i-}r] ... |
And 37 fractions, which are of the form {a/b} where a and b are integers.
Frequency | Special Code | Example Word | Usage |
5 | {1/1000} | [gram] | D;a metric unit of mass and weight equal to {1/1000} kilogram ... |
3 | {1/100} | [cent] | D;a unit of value equal to {1/100} part of basic monetary unit ... |
4 | {1/10} | [blind] | D;having less than {1/10} of normal vision in the more efficient eye ... |
2 | {1/12} | [month] | D;a measure of time ... or {1/12} of a year |
1 | {1/14} | [agate line] | D;a space one column wide and {1/14} inch deep used ... |
4 | {1/16} | [anna] | D;a former monetary unit of Burma, ... equal to {1/16} rupee |
1 | {1/1837} | [electron] | D;an elementary particle ... or {1/1837} that of a proton |
1 | {1/192} | [pie] | D;a former monetary unit of India ... equal to {1/192} of the rupee |
7 | {1/20} | [fluidounce] | D;a British unit of liquid capacity equal to {1/20} pint |
1 | {1/22} | [piaster] | D;a former monetary unit of Saudi Arabia equal to {1/22} riyal |
1 | {1/24} | [karat] | D;a unit of fineness for gold equal to {1/24} part of pure gold in an alloy |
26 | {1/2} | [agate] | D;a size of type approximately 5{1/2} point |
1 | {1/34} | [maravedi] | D;a medieval Spanish copper coin unit worth {1/34} real |
1 | {1/360} | [simple interest] | D;interest paid ... on the assumption that each day is {1/360} of a year |
1 | {1/36} | [inch] | D;a unit of length equal to {1/36} yard |
9 | {1/3} | [LP] | D;-- used for ... turning at 33{1/3} revolutions per minute |
1 | {1/4000} | [para] | D;a Turkish monetary unit equal in modern Turkey to {1/4000} of a lira |
12 | {1/4} | [O gauge] | D;a gauge of track ... approximately 1{1/4} inches apart |
1 | {1/6400} | [mil] | D;a unit of angular measurement ... equal to {1/6400} of 360 degrees |
1 | {1/64} | [pice] | D;a former monetary unit ... Pakistan equal to {1/64} of the rupee |
2 | {1/6} | [obol] | D;an ancient Greek coin or weight equal to {1/6} drachma |
1 | {1/72} | [point] | D;a unit of about {1/72} inch used to measure the belly-to-back ... |
1 | {1/746} | [watt] | D;the absolute mks unit ... : {1/746} horsepower |
2 | {1/8} | [bit] | D;a unit of value equal to {1/8} of a dollar <four ~[italic s]> |
1 | {2/10} | [decimal] | D;a proper fraction ... (as .2 {equals} {2/10}, ...) |
2 | {2/3} | [mixed number] | D;a number (as [italic 5{2/3}] or [italic 7.238]) composed ... |
3 | {2/4} | [one-step] | D;a ballroom dance ... in {2/4} time |
1 | {2/5} | [magnum] | D;a large wine bottle holding about {2/5} of a gallon |
1 | {25/1000} | [decimal] | D;a proper fraction ... (... .25 {equals} {25/100}, ...) |
1 | {25/100} | [decimal] | D;a proper fraction ... (... .025 {equals} {25/1000}) |
1 | {3/2} | [reciprocal] | D;one of a pair of numbers (as {2/3}, {3/2}) whose product is one |
9 | {3/4} | [O O gauge] | D;a gauge of track ... approximately {3/4} inch apart |
1 | {3/8} | [mazurka] | D;music for the mazurka or ... in moderate {3/4} or {3/8} time |
3 | {4/4} | [gavotte] | D;a tune for the gavotte in moderately quick {4/4} time |
1 | {4/5} | [jeroboam] | D;a wine bottle holding about {4/5} gallon |
3 | {5/8} | [fraction] | D;a numerical representation (as {3/4}, {5/8}, 3.234) of two numbers ... |
3 | {6/8} | [march] | D;an instrumental ... triply compound rhythm (as {6/8} time) ... |
Home | Comments? |