The UPSID
folder contains data from the UCLA Phonological Segment
Inventory Database:
Maddieson, I., & Precoda, K. (1990). Updating UPSID. UCLA Working Papers in Phonetics, 74, 104–111.
The contents and extraction pipeline for these data are described in (chapter 4):
Moran, Steven. (2012). Phonetics Information Base and Lexicon. PhD thesis, University of Washington. Online: https://digital.lib.washington.edu/researchworks/handle/1773/22452.
The data are available in several files in this directory from the original ASCII dump. These inventories contain only phonemes, with no information on allophones or linguistic tone.
We have converted IPA symbols in the raw data in line with the phoible conventions and Unicode IPA as described in the UPSID_IPA_correspondences.tsv file.
Note that Henning Reetz has put online a simple user interface to the UPSID data, which can be used for browsing and quick queries.