Domestic and International Common Language Database (DICL)
The database contains index measures of linguistic similarity both domestically and internationally. The domestic measures capture linguistic similarities present among populations within a single country while the international indexes capture language similarities between two different countries. The 8 indices reflect three different aspects of language: common official languages, common native and acquired spoken languages, and linguistic proximity across different languages. This database has many uses, such as in models of bilateral flows—including FDI, migration, and international trade—as well as in regional or country level analyses.
Extensive and detailed coverage
- Bilateral indexes for 242 countries
- Based on 6,674 individual languages
Download the dataset
The database is available as a single comma-separated (csv) file (6.09MB). The first line of the file contains variable names.
DOWNLOAD LINK for the DICL database
Recommended citation
Update notes
- March 2024: Introduced 5 additional language indices to database; added coverage of several additional languages to existing indices; relabeled several existing indices; and removed one redundant index (CL), which can readily be constructed from remaining indices as 0.5*(CNL+LPN).
- March 2021: DICL first released.
Related research
Prior versions
Contact us
If you have any remaining questions, need to report an error, or make a suggestion, please contact us.