Comparison of ASCII encodings of the International Phonetic Alphabet

From Wikipedia, the free encyclopedia
(Redirected from ASCII IPA)
Jump to navigation Jump to search

Template:SHORTDESC:

The International Phonetic Alphabet (IPA) consists of more than 100 letters and diacritics. Before Unicode became widely available, several ASCII-based encoding systems of the IPA were proposed. The alphabet went through a large revision at the Kiel Convention of 1989, and the vowel symbols again in 1993.[1] Systems devised before these revisions inevitably lack support for the additions they introduced.

Only language-neutral systems are discussed below because language-dependent ones (such as ARPABET) do not allow for a systematic comparison.

General information

[edit | edit source]
System Author(s) Created Last
updated
Note Ref
Branner (unnamed) David Prager Branner at the University of Washington 1994 ? [2]
Millar & Oasa (unnamed) J. Bruce Millar and Hiroaki Oasa at Australian National University 1981 1981 [3]
PHONASCII George D. Allen at Purdue University 1988 1988 Not a direct mapping of the IPA. Segments are separated by spaces, and diacritics by commas. [4]
Praat Paul Boersma and David Weenink at the University of Amsterdam 1992 2025 A subset of a larger set of backslash trigraphs also comprising international and mathematical characters. The symbols can be typed in ASCII but will be visualized as the normal Unicode glyphs. [5]
IPA (SIL) Keyboard SIL International 1994 2021 [6]
UCLA Phonological Segment Inventory Database (UPSID) Ian Maddieson at the University of California, Los Angeles 1984 ? Presented here is the scheme used for representing phonemes in the database of phonological inventories. Consequently, it is not designed for transcription of multiple segments and does not have symbols for values not found phonemically in the languages sampled. [7]
Usenet ASCII-IPA transcription Participants in sci.lang and alt.usage.english newsgroups (later maintained by Evan Kirshenbaum at HP Labs) 1991 2011 Also known variously as "ASCII-IPA", "Kirshenbaum", etc.[8] IETF language subtags register fonkirsh to identify text in this convention.[9] [10]
Worldbet James L. Hieronymus at AT&T Bell Laboratories 1994 1994 Segments are separated by spaces. [11]
X-SAMPA John C. Wells at University College London 1995 2000 IETF language subtags register fonxsamp to identify text in this convention.[9] [12]

Symbols

[edit | edit source]

Only the symbols in the latest IPA chart are included. The numbers in the leftmost column, according to which the symbols are sorted, are the IPA Numbers. Some of the IPA symbols to which a system lacks a corresponding symbol may still be represented in that system by use of a modifier (diacritic), but such combinations are not included unless the documentation explicitly assigns one for the value.

Coverage

[edit | edit source]
Scope Branner Millar & Oasa PHONASCII Praat SIL UPSID Usenet Worldbet X-SAMPA
Consonants (80) 79 (99%) 69 (86%) 67 (84%) 80 (100%) 80 (100%) 75 (94%) 73 (91%) 73 (91%) 79 (99%)
Vowels (29) 29 (100%) 27 (93%) 26 (90%) 29 (100%) 28 (97%) 28 (97%) 28 (97%) 26 (90%) 29 (100%)
Diacritics (35) 34 (97%) 15 (43%) 25 (71%) 35 (100%) 34 (97%) 12 (34%) 17 (49%) 25 (71%) 26 (74%)
Suprasegmentals (28) 28 (100%) 20 (71%) 21 (75%) 28 (100%) 28 (100%) 2 (7%) 4 (14%) 11 (39%) 28 (100%)
Total (172) 170 (99%) 131 (76%) 139 (81%) 172 (100%) 170 (99%) 117 (68%) 122 (71%) 135 (78%) 162 (94%)

See also

[edit | edit source]

Notes

[edit | edit source]
  1. ^ a b c d e f g h i j k l In Worldbet, these combinations are given as merely proposed for values "for which no machine-readable coding has yet been proposed".
  2. ^ The uvular approximant is represented by R in PHONASCII.
  3. ^ a b c L represents either a voiceless alveolar lateral fricative, a velar approximant, or a velarized alveolar lateral approximant in the Usenet IPA/ASCII transcription.
  4. ^ a b c! represents either an alveolar or palatal click in the Usenet IPA/ASCII transcription.
  5. ^ a b - represents either retracted or "velarized or pharyngealized" in Millar & Oasa's system.
  6. ^ a b ¿ and ¡ are not part of ASCII, but are nonetheless proposed as encoding advanced and retracted tongue root, respectively, in Worldbet.
  7. ^ . represents either raised or palatalized in Millar & Oasa's system.
  8. ^ a b * represents either non-syllabic or extra-short in Millar & Oasa's system.
  9. ^ )) representing a tie bar is placed after both segments, as in ts)), in Branner's system.

References

[edit | edit source]
  1. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  2. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  3. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  4. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  5. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  6. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  7. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  8. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). Page 74.
  9. ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  10. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  11. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  12. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
[edit | edit source]