konect logo
KONECT
KONECT > Networks > Unicode languages

Unicode languages

About this network

This bipartite network denotes which languages are spoken in which countries. Nodes are countries and languages; edge weights denote the proportion (between zero and one) of the population of a given country speaking a given language. To quote the Unicode data description: "The main goal is to provide approximate figures for the literate, functional population for each language in each territory: that is, the population that is able to read and write each language, and is comfortable enough to use it with computers."

Network info

CodeUL
Category Feature
Date of origin2015
Data source http://www.unicode.org/cldr/charts/25/supplemental/territory_language_information.html
Vertex type Country, language
Edge type Hosts
FormatBipartite: Edges connect two types of nodes Bipartite
Edge weightsPositive weights: Positively weighted edges Positive weights
Metadata Zero weights:  Edges may have a weight of zero Zero weightEntity metadata:  Nodes are annotated with metadata Entity
Size1,122 = 868 + 254 vertices (countries + languages)
Volume1,255 edges (hostss)
Average degree (overall)2.8917 edges / vertex
Average country degree4.9409 edges / vertex
Average language degree2.0440 edges / vertex
Fill0.0070917 edges / vertex2
Maximum degree141 edges
Wedge count21,977
Claw count521,909
Square count1,266
4-tour count86,712
Power law exponent (estimated) with dmin2.3710 (dmin = 3)
Gini coefficient58.3%
Relative edge distribution entropy88.9%
Assortativity–0.25144
Diameter8 edges
90-percentile effective diameter5.24 edges
Mean shortest path length4.08 edges
Spectral norm7.9343
Algebraic connectivity0.00026391
Country degree distribution of the Unicode languages network
Country degree distribution
Language degree distribution of the Unicode languages network
Language degree distribution
Country degree distribution of the Unicode languages network
Country degree distribution
Language degree distribution of the Unicode languages network
Language degree distribution
Left degree distribution of the Unicode languages network
Left degree distribution
Right degree distribution of the Unicode languages network
Right degree distribution
Degree distribution of the Unicode languages network
Degree distribution
Country degree distribution of the Unicode languages network
Country degree distribution
Language degree distribution of the Unicode languages network
Language degree distribution
Distance distribution of the Unicode languages network
Distance distribution
Distance distribution on a logistic scale of the Unicode languages network
Distance distribution on a logistic scale
Top-k eigenvalues of L of the Unicode languages network
Top-k eigenvalues of L
Spectral distribution of the eigenvalues of A of the Unicode languages network
Spectral distribution of the eigenvalues of A
Spectral distribution of the eigenvalues of N of the Unicode languages network
Spectral distribution of the eigenvalues of N
Spectral distribution of the eigenvalues of L of the Unicode languages network
Spectral distribution of the eigenvalues of L
Cumulative spectral distribution of A of the Unicode languages network
Cumulative spectral distribution of A
Cumulative spectral distribution of N of the Unicode languages network
Cumulative spectral distribution of N
Cumulative spectral distribution of L of the Unicode languages network
Cumulative spectral distribution of L
Eigenvectors of A of the Unicode languages network
Eigenvectors of A
Eigenvectors of A (bipartite) of the Unicode languages network
Eigenvectors of A (bipartite)
Eigenvectors of L of the Unicode languages network
Eigenvectors of L

Layout

Layout of the Unicode languages network
Layout

Downloads

TSV file:downloadunicodelang.tar.bz2 (7.77 KiB)
Extraction code:downloadunicodelang.tar.bz2 (25.50 KiB)

References

[1] Unicode languages network dataset -- KONECT, October 2016. [ http ]

BibTeX