FEWL (the FLEWSY Encoded Word List) is a collection of
fairly common English words, encoded in a notation called FLEWSY
(Fonemic Latin-1 English Writing SYstem). FLEWSY has been
designed as an encoding of phonemic and morphological information about
English words which facilitates transliteration into many reformed
spelling systems. The vocabulary of FEWL is derived from the
12dicts project (see this
page). The FLEWSY notation is ultimately derived from my
spelling system MCM.
This page describes the fourth version of FEWL, dating
from February, 2005.
Here is a tiny fragment of the FEWL list:
ajè'nda
- agenda
E'j(ent - agent
E'j-O'ld - age-old
E'jë$ - ages
aglò·merE'Xon - agglomeration
Each line is composed of a FLEWSY encoding followed by the separator " - " and the
traditional spelling of the word. For some simple reform systems,
it may be possible to transform the FEWL list into a list of respelled
words using nothing more than a number of "Replace" commands in a good
text editor. For more complex systems, some programming will be
necessary to perform the necessary transformations.
The level of programming expertise required for this purpose is not
great. Anyone familiar with the use of "regular expressions" will
almost certainly be able to construct a suitable transformation
program. The level of effort required will of course depend on
the complexity of the target system. See this page for a discussion for programmers
of how to implement a FEWL converter, including a link to a .zip file
containing sample programs in Python.
FEWL is available for download here.
FLEWSY is an encoding system for English words, using the
Latin-1 character set. It encodes four kinds of
information:
FLEWSY is too complex to be of much use itself as a
spelling system. But it contains much information that is
relevant to common reformed spelling schemes. When FLEWSY is
transformed into another encoding, the information which is not
relevant to the target scheme can be simply ignored in the
transformation process.
The heart of FLEWSY is its encoding of English phonemes. It is easiest to present FLEWSY as a phonemic encoding, and then to present the remaining elements as embellishments of that encoding. Most examples below are encoded only in the phonemic part of FLEWSY for simplicity - their actual encodings in the FEWL list are necessarily more complicated.
Though FLEWSY's complexity makes it unsuitable for use as
a spelling system, I have devised a practical (if odd-looking) system FLOSS (Fonemic Latin-One Spelling System)
based rather closely on the phonemic parts of FLEWSY. Despite its
high degree of unfamiliarity and use of an extended mixed-case
character set, it is surprisingly readable, though not as phonemic as MCM, from which it was ultimately derived.
FLEWSY symbol |
English sound |
Example words |
b |
b |
bEb¥ [baby] |
C |
ch |
Cìken [chicken] |
d |
d |
dìd [did] |
D |
th (voiced) |
Dàt [that] |
f |
f |
flùf [fluff] |
g |
g |
gìg3l [giggle] |
G |
ng (soft) |
wìG [wing] |
h |
h |
hàp¥ [happy] |
j |
j |
jùj [judge] |
J |
zh (French j) |
vìJon [vision] |
k |
k |
kEk [cake] |
K |
kh (German ch) |
lOK [loch] |
l |
l |
lEbel [label] |
m |
m |
mòm [mom] |
n |
n |
nùn [none] |
p |
p |
pIp3l [people] |
r |
r |
rØr [roar] |
s |
s |
sìster [sister] |
t |
t |
tOtal [total] |
T |
th (unvoiced) |
Tìk [thick] |
v |
v |
rêvòlv [revolve] |
w |
w |
wèt [wet] |
X |
sh |
mìXon [mission] |
y |
y |
yès [yes] |
z |
z |
zìgzàg [zigzag] |
FLEWSY uses the following symbols for English vowels:
FLEWSY symbol |
English sound |
Example words |
a, e, i, o, u,
µ, 3 |
schwa (note 1) |
atèmpt [attempt] rìvet [rivet] dèvil [devil] lèmon [lemon] fØrum [forum] òkyµpY [occupy] bùb3l [bubble] |
â, ê |
short i (note 2) |
mànâj [manage] bêlIv [believe] |
ä, ë,
ï, ö, ü |
indistinct i (note 3) |
mènäs [menace] fØrëst [forest] lìmït [limit] rèkögnYz [recognize] mìnüt [minute] |
à |
short a (note 4) |
hàp¥ [happy] |
ã |
nasal a (French) |
dEnUmã [denouement] |
@ |
variable a (note 4) |
d@ns [dance] |
A |
broad a (note 5) |
fADer [father] |
è |
short e |
bèd [bed] |
E |
long a (European e) |
vEl [veil] |
H |
nasalization |
ùHhùH [uh-huh] |
ì |
short i |
sìk [sick] |
I |
stressed long e
(European i) |
grIf [grief] |
ò |
short o (note 5) |
hòt [hot] |
õ |
nasal o (French) |
kõsyèrJ
[concierge] |
O |
long o |
slO [slow] |
Ö |
variable o (note 6) |
lÖs [loss] |
Ø |
au/aw (broad o) (note 6) |
klØ [claw] |
Q |
oi/oy |
jQful [joyful] |
& |
stressed er (note 7) |
h&t [hurt] |
ù |
short u |
mùd [mud] |
U |
long u/oo |
rUd [rude] |
V |
short oo |
wVd [wood] |
W |
ou/ow |
plW [plow] |
Y |
long i |
flY [fly] |
¥ |
unstressed long e
(note 8) |
sìl¥ [silly] |
Notes:
All five unaccented vowel letters represent the schwa sound. The letter used in a particular word is taken from its traditional spelling. Additionally, the symbol µ represents the schwa sound where the sound results from destressing the U or V sound, and the symbol 3 represents the schwa when the English spelling shows no vowel (as with bubble or chasm).
The symbols â and ê are used for the unstressed short i sound when it is traditionally spelled with a or e. These symbols are used only when the sound is pronounced distinctly.
The symbolls ä, ë, ï, ö and ü are used for an unstressed, indistinct sound which varies from speaker to speaker between the short i sound and the schwa. This sound frequently occurs in suffixes and inflectional endings such as -ity, -ness, -age and -ed. As with the schwa representations, the letter used is derived from the corresponding traditional spelling.
The symbol @ is used for the
short a (/{/) sound in American English when the corresponding British
English pronunciation is /A:/. In all other cases, the symbol
à is used for this sound.
The symbols A and ò represent the same sound in American English. The ò indicates that the British pronunciation of the word uses the British short o sound (Sampa /Q/), and A that the British pronunciation uses the American sound. If the British pronunciation is neither /Q/ nor /A:/, the symbol corresponding to the standard spelling is used. A British version of FLEWSY should use ò exclusively for the British short o.
The symbols Ö and
Ø represent the same sound in American English. The Ö
indicates that the British pronunciation of the word uses the British
short o sound (Sampa /Q/), and Ø that the British
pronunciation uses some other sound.
The symbol & represents a stressed er sound (Sampa /3`/), as in herd or eternal. An unstressed er sound (Sampa /@r/) is represented as a schwa followed by r, as in drYver [driver] or arìTmetìk [arithmetic].
The symbol ¥ is used to indicate the shorter less distinct form of long e which frequently occurs at the end of words and before other vowels, and which is commonly spelled y. This vowel is often represented in Sampa as /i/, as opposed to the full long e sound represented as /i:/.
FLEWSY uses the following symbols for vowel sounds used
before the letter r. In American English, these sounds are
generally merged with one of the vowel sounds above. By
permitting these sounds to be spelled differently depending on how they
are spoken in RP, FEWL can better support systems which attempt to
handle British as well as American English.
FLEWSY symbol |
RP equivalent (Sampa) |
Corresponding American sound (FLEWSY) |
Example words |
Ër |
/e@(r)/ |
èr (notes 1, 2) |
fËr¥ (fairy), bèr¥ (berry) |
Ïr |
/I@(r)/ |
ìr |
sÏr¥us (serious), pìr@mìd (pyramid) |
úr |
/Vr/ |
& |
húr¥ (hurry), bl&¥ (blurry) |
Ür |
/Ur/ |
Vr (note 3) |
kÜr¥er (courier), kyVr¥O (curio) |
Notes:
Some American dialects have a similar sound. The word "Mary", if pronounced differently from "marry" (màr¥) and "merry" (mèr¥) is the standard example.
In many words ending with -ary,
such as "secretary", the American pronunciation is a stressed /Eri/,
while the British pronunciation is an unstressed /@ri/. Because
this pronunciation is compatible with the -ary spelling, such words are
rendered with Ër¥
rather than èr¥.
Ordinarily, the American Vr
spelling is pronounced /U@(r)/ in RP. The Ür spelling is used for
the unusual cases where the /Ur/ pronunciation is also used in American
English.
A few FLEWSY characters indicate optional pronunciation elements, or sounds which depend on context. These are as follows:
FLEWSY uses two symbols to indicate stress: ' and ·
. The apostrophe indicates primary stress, and the raised dot
represents secondary stress. More than one syllable may be shown
with primary stress, especially in compound words. The stress
marker always follows a vowel. Examples: kà't [cat],
Ø·rganïzE'Xon
[organization],
sè'kondhà'nd
[secondhand]. A small number of words,
which are normally pronounced unstressed, are shown with no stress: ov [of], De [the], tµ [to].
FLEWSY indicates, in a somewhat imperfect way, when words
are combined or extended through the use of affixes or
compounding. The following symbols and conventions are used for
this purpose:
Note that all of these symbols (other than the colon and
the quote mark) are used in a quite
imprecise way - while they may provide useful guidance to whether a
word is related to other words, they must not be taken too
seriously. The main value of these notations is that they allow
systems which vary the representation of sounds by position to take
into account connections between words that are not obvious. For
instance, a system which represents vowel sounds differently at the end
of a word might want to treat the a in sofabed as if it ended the word,
but this is hard for a converter to handle automatically unless the
compounding is noted in the
FLEWSY representation.
The FEWL list was derived from the TARDRE
dictionary for my spelling system DRE.
The pronunciations listed in FEWL reflect what I call CLAEP, Consensus
Lexicographic American English Pronunciation. For each word, the
pronunciation shown was determined from the first listed pronunciation
in each of three dictionaries, resolved by majority vote. The
three dictionaries are the CD-ROM version of The Merriam-Webster
Collegiate Dictionary 11th edition, the fourth edition American
Heritage Dictionary
CD ROM, and the Longman Pronunciation Dictionary. In the event
these sources did not agree, the Random House Unabridged Dictionary CD
ROM was
used as a tiebreaker. The listed pronunciations often do not
reflect my own. Some words have two or three different
pronunciations, depending on part of speech or meaning, in which case
one pronunciation for each distinct form is listed, and an indication
of which form it applies to follows the traditional spelling in
parentheses.
It is worth noting how a few specific notations were
determined:
For various reasons, a few words in FEWL are shown with a pronunciation which differs from the CLAEP pronunciation. These "signature words" are listed and discussed below.
The FEWL word list was constructed in a number of steps, each step adding more detail to the previous step. The results of the earlier steps must be considered to be more reliable than the later steps, since they have been more thoroughly checked. The steps were as follows:
Because it was compiled so late in the process, and
because the criteria used were rather subjective, the information on
compound words and suffixes must be considered considerably less
reliable than the other parts of FEWL.
The concept of signature words was originally introduced
for commericial word lists (as for spelling checkers) to indicate
unusual words added to the list, allowing unauthorized copies of the
list to be recognized. This concept has been broadened to
indicate words on a list which technically do not belong. For
instance, the ENABLE word list (see here)
is a large non-commercial list of words
recognized as
valid plays in Scrabble. The ENABLE signature words are a small
set of perfectly valid words which are not however permitted in
Scrabble tournaments. (Scrabble is a trademark of Hasbro, Inc.)
The FEWL signature words are words which are listed with a
pronunciation other than their CLAEP pronunciation. Most of these
words were given a less common pronunciation to improve their
representation in DRE, for instance, to allow homonyms to be
distinguished, or to increase the regularity of the DRE spellings.
Very demanding users of FEWL might wish to reference the
actual CLAEP-determined pronunciation - for this reason, the FEWL
signature words and the way they "should" have been represented in FEWL
are listed below:
à'ber(ant - aberrant [should be
àbè'r(ant]
A'rkEì·z3m - archaism [should be
A'rk¥i·z3m]
ArC/dY'osIs - archdiocese [should be
ArC/DY'osës]
A'nt -
aunt [should be @'nt]
bolO'nya - bologna [should be bolO'n¥]
bØ'rO -
borrow [should be bò'rO]
bØ'rO\(er - borrower [should be
bò'rO\(er]
kolè'sterØ·l - cholesterol [should be
kolè'sterO·l]
kò'mptrO·l{er - comptroller [should be
kontrO'l{er]
tsA'r -
czar [should be zA'r]
tsAr\I'na - czarina [should be zAr\I'na]
dY'osIs - diocese [should be dY'osës]
dØ'gØ'n - doggone [should be
dò'gò'n]
fè'brUË·r¥ - February [should be
fè'by?UË·r¥]
hà·lOI'n - Halloween [should be
hà·lowI'n]
hù'H -
huh [should be hu', or so they say]
ì·/maCV'r - immature [should be
ì·/maty?V'r]
ì·/maCV'r\l¥ - immaturely
[should be ì·/maty?V'r\l¥]
ì·/maCV'r\(ït¥ - immaturity [should
be ì·/maty?V'r\(ït¥]
ì·n/dêpè'nd\(ens -
independence [should be ì·n/dëpè'nd\(ens]
ì·n/dêpè'nd\(ent - independent
[should be ì·n/dëpè'nd\(ent]
ì·n/dêpè'nd\(ent\l¥ -
independently [should be ì·n/dëpè'nd\(ent\l¥]
jè'zy?U·ït - Jesuit [should be
jè'JU·ït]
jU'dEì·z3m - Judaism [should be
jU'd¥ì·z3m]
kimO'nO
-
kimono [should be kimO'no]
lI'Ezò·n - liaison [should be
lI'azò·n]
lyA'ma -
llama [should be lA'ma]
maCV'r
- mature [should be maty?V'r]
maCV'r\l¥ - maturely [should be maty?V'r\l¥]
maCV'r\(ït¥ - maturity [should be maty?V'r\(ït¥]
mIlyU' -
milieu [should be milyV']
nAI'v -
naive [should be nYI'v]
nAI'v\l¥ - naively [should be nYI'v\l¥]
nAI·v\tE' - naivete [should be
nYI·v\tE']
nè'sësË·rï}l¥ -
necessarily [should be
nè'sësè·rï}l¥]
nè'sësË·r¥ - necessary [should
be nè'sësè·r¥]
prI·maCV'r - premature [should be
prI·maty?V'r]
prI·maCV'r\l¥ - prematurely [should be
prI·maty?V'r\l¥]
prYmË'rï}l¥ - primarily [should
be prYmè'rï}l¥]
rE'X¥O· - ratio [should be rE'XO]
rêkò'nasans - reconnaissance [should be
rêkò'nazans]
sà'karI·n - saccharine [should be
sà'ka?rïn]
sèrE't\ëþ - serrated [should be
serE't\ëþ]
sØ'rO -
sorrow [should be sò'rO]
sØ'rO\ful - sorrowful [should be
sò'rO\ful]
sØ'rO\fu?l¥ - sorrowfully [should
be
sò'rO\fu?l¥]
sØ'r¥ - sorry [should be
sò'r¥]
stA'nC
-
stanch [should be stØ'nC]
T&'z/dE -
Thursday [should be T&'zd¥]
tµmØ'rO - tomorrow [should be
tµmò'rO]
trØmà'\tìk - traumatic [should be
tramà'}tìk]
ty?U'z/dE -
Tuesday [should be ty?U'zd¥]
ù·n/àksè'pt\(ab3l
- unacceptable [should be ù·n/aksè'pt\(ab3l]
ù·n/àksè'pt\(abl¥ -
unacceptably [should be ù·n/aksè'pt\(abl¥]
ùn/kù'mfort\{ab3l - uncomfortable [should be
ùn/kù'mft{ab3l]
ùn/kù'mfort\{abl¥ -
uncomfortably [should be ùn/kù'mft{abl¥]
ù·n/wò'Xþ - unwashed [should be
ù·n/wØ'Xþ]
v&'tëbrI· - vertebrae [should be
v&'tëbrE·]
vò·luntË'rï}l¥ -
voluntarily [should be
vò·luntè'rï}l¥]
wÏ'r/wV·lf - werewolf [should be
wË'r/wV·lf]
wÏ'r/wV·lv$ - werewolves [should
be wË'r/wV·lv$]
*wùt=è'ver - whatever [should be
*wòt=è'ver]
*wì't/sù'n/dE - Whitsunday [should be
*wì't/sù'nd¥]
yè' -
yeah [should be yè'a]
For some practical recommendations on converting FLEWSY to your own notation, please proceed to this page.
To comment on this page,
e-mail Alan at wyrdplay.org
Go to wyrdplay.org home
page
Go to wyrdplay.org spelling
system roster