konect logo
KONECT
KONECT > Networks > Reuters-21578

Reuters-21578

About this network

This is the bipartite network of article–word inclusions in documents that appeared on Reuters newswire in 1987. Left nodes represent articles and right nodes represent words. An edge represents an article–word inclusion.

Network info

CodeR2
Category Text
Data source http://www.daviddlewis.com/resources/testcollections/reuters21578/
Vertex type Article, word
Edge type Inclusion
FormatBipartite: Edges connect two types of nodes Bipartite
Edge weightsMultiple unweighted: Multiple edges are possible Multiple unweighted
Size81,791 = 60,234 + 21,557 vertices (articles + words)
Volume1,464,182 edges (inclusions)
Unique volume978,446 edges (inclusions)
Average degree (overall)48.616 edges / vertex
Average article degree67.921 edges / vertex
Average word degree37.857 edges / vertex
Fill0.0011735 edges / vertex2
Maximum degree19,044 edges
Wedge count821,566,836
Claw count1,978,784,882,823
Square count2,502,669,891
4-tour count23,309,665,440
Power law exponent (estimated) with dmin2.4010 (dmin = 60)
Gini coefficient75.4%
Relative edge distribution entropy85.7%
Assortativity–0.14787
Diameter7 edges
90-percentile effective diameter3.85 edges
Mean shortest path length3.45 edges
Spectral norm685.71
Algebraic connectivity0.21887
Edge multiplicity distribution of the Reuters-21578 network
Edge multiplicity distribution
Cumulative edge multiplicity distribution of the Reuters-21578 network
Cumulative edge multiplicity distribution
Article degree distribution of the Reuters-21578 network
Article degree distribution
Word degree distribution of the Reuters-21578 network
Word degree distribution
Article degree distribution of the Reuters-21578 network
Article degree distribution
Word degree distribution of the Reuters-21578 network
Word degree distribution
Left degree distribution of the Reuters-21578 network
Left degree distribution
Right degree distribution of the Reuters-21578 network
Right degree distribution
Degree distribution of the Reuters-21578 network
Degree distribution
Article degree distribution of the Reuters-21578 network
Article degree distribution
Word degree distribution of the Reuters-21578 network
Word degree distribution
Distance distribution of the Reuters-21578 network
Distance distribution
Distance distribution on a logistic scale of the Reuters-21578 network
Distance distribution on a logistic scale
Top-k eigenvalues of L of the Reuters-21578 network
Top-k eigenvalues of L
Spectral distribution of the eigenvalues of A of the Reuters-21578 network
Spectral distribution of the eigenvalues of A
Spectral distribution of the eigenvalues of N of the Reuters-21578 network
Spectral distribution of the eigenvalues of N
Spectral distribution of the eigenvalues of L of the Reuters-21578 network
Spectral distribution of the eigenvalues of L
Cumulative spectral distribution of A of the Reuters-21578 network
Cumulative spectral distribution of A
Cumulative spectral distribution of N of the Reuters-21578 network
Cumulative spectral distribution of N
Cumulative spectral distribution of L of the Reuters-21578 network
Cumulative spectral distribution of L

Downloads

TSV file:downloadgottron-reuters.tar.bz2 (3.13 MiB)

References

[1] Reuters-21578 network dataset -- KONECT, October 2016. [ http ]
[2] David D. Lewis, Yiming Yang, Tony G. Rose, and Fan Li. RCV1: A new benchmark collection for text categorization research. J. Machine Learning Research, 5:361--397, 2004.

BibTeX