konect logo
KONECT
KONECT > Networks > Reuters

Reuters

About this network

This is the bipartite network of story–word inclusions in documents that appeared in Reuters news stories collected in the Reuters Corpus, Volume 1 (RCV1). Left nodes represent stories; right nodes represent words. An edge represents a story–word inclusion.

Network info

CodeRE
Category Text
Data source http://trec.nist.gov/data/reuters/reuters.html
Vertex type Story, word
Edge type Inclusion
FormatBipartite: Edges connect two types of nodes Bipartite
Edge weightsMultiple unweighted: Multiple edges are possible Multiple unweighted
Metadata Entity metadata:  Nodes are annotated with metadata Entity
Size1,846,441 = 1,065,176 + 781,265 vertices (stories + words)
Volume96,903,520 edges (inclusions)
Unique volume60,569,726 edges (inclusions)
Average degree (overall)181.95 edges / vertex
Average story degree124.03 edges / vertex
Average word degree341.32 edges / vertex
Fill0.00027307 edges / vertex2
Maximum degree345,056 edges
Wedge count1,546,388,153,215
Claw count6.4992574078209184 1016
Power law exponent (estimated) with dmin2.5110 (dmin = 59)
Gini coefficient68.0%
Relative edge distribution entropy82.6%
Assortativity–0.12469
Diameter6 edges
90-percentile effective diameter3.33 edges
Mean shortest path length2.69 edges
Spectral norm6502.1
Edge multiplicity distribution of the Reuters network
Edge multiplicity distribution
Cumulative edge multiplicity distribution of the Reuters network
Cumulative edge multiplicity distribution
Story degree distribution of the Reuters network
Story degree distribution
Word degree distribution of the Reuters network
Word degree distribution
Story degree distribution of the Reuters network
Story degree distribution
Word degree distribution of the Reuters network
Word degree distribution
Degree distribution of the Reuters network
Degree distribution
Story degree distribution of the Reuters network
Story degree distribution
Word degree distribution of the Reuters network
Word degree distribution
Distance distribution of the Reuters network
Distance distribution
Distance distribution on a logistic scale of the Reuters network
Distance distribution on a logistic scale
Spectral distribution of the eigenvalues of A of the Reuters network
Spectral distribution of the eigenvalues of A
Spectral distribution of the eigenvalues of N of the Reuters network
Spectral distribution of the eigenvalues of N
Spectral distribution of the eigenvalues of L of the Reuters network
Spectral distribution of the eigenvalues of L
Cumulative spectral distribution of A of the Reuters network
Cumulative spectral distribution of A
Cumulative spectral distribution of N of the Reuters network
Cumulative spectral distribution of N
Cumulative spectral distribution of L of the Reuters network
Cumulative spectral distribution of L

Downloads

TSV file:downloadreuters.tar.bz2 (284.44 MiB)

References

[1] Reuters network dataset -- KONECT, April 2017. [ http ]
[2] David D. Lewis, Yiming Yang, Tony G. Rose, and Fan Li. RCV1: A new benchmark collection for text categorization research. J. Machine Learning Research, 5:361--397, 2004.

BibTeX