The database contains index measures of linguistic similarity both domestically and internationally. The domestic measures capture linguistic similarities present among populations within a single country while the international indexes capture language similarities between two different countries. The indexes reflect three aspects of language: common official languages, common native languages, and linguistic proximity across languages. This database has many uses, such as in in models of bilateral flows—including FDI, migration, and international trade—as well as in regional or country level analyses.
Extensive and detailed coverage
- Bilateral indexes for 242 countries
- Based on 6,534 individual languages
Download the dataset
Data is in comma-separated (csv) format. The first line of each file contains variable names. The file is compressed (zipped) and has a size of about 365kb.
Learn more about the database
If you have any remaining questions, need to report an error, or make a suggestion, please contact us.