Moby Project

The Moby Project is a collection of public-domain lexical resources created by Grady Ward.

The resources were dedicated to the public domain, and are now mirrored at Project Gutenberg.

Moby Part-of-Speech contains 233,356 words fully described by part(s) of speech, listed in priority order.

Each line is ended with the ASCII carriage return character (CR, '\r', 0x0D, 13 in decimal).

Non-English words are generally rendered, as stated in the documentation, without accents or other diacritical marks.

However, in 36 entries (e.g. São_Miguel), some non-ASCII accented characters remain, represented using Mac OS Roman encoding.

The following table contains these extra phonemes, but note that the extent to which some of these may exist due to encoding errors is not clear.