Contains the complete, raw linguistic data used to create the Ethnologue. Files are in the standard tab-delimited format, which can be loaded into virtually any spreadsheet, database, or other data analysis tool. Note that this dataset does not include commentary, but rather, data fields with simple values that can be submitted to statistical analysis. Includes three files: Language Data (ISO codes, language families, statuses, country counts, centroid points, and more for 7,457 languages); Country Data (language counts, literacy rates, populations, diversity indexes, and more for 237 countries); and Language-in-Country Data (11,177 listings of data specific to a particular language within a particular country where it is used).
To gain access to this dataset please contact the Electronic Resources Unit @csa-er-management-list@nd.edu