## About this network

This is the bipartite network of storyâ€“word inclusions in documents that appeared in Reuters news stories collected in the Reuters Corpus, Volume 1 (RCV1). Left nodes represent stories; right nodes represent words. An edge represents a storyâ€“word inclusion.

## Network info

Code | RE |

Category | ⬤ Text |

Data source | http://trec.nist.gov/data/reuters/reuters.html |

Vertex type | Story, word |

Edge type | Inclusion |

Format | Bipartite |

Edge weights | Multiple unweighted |

Metadata | Entity |

Size | 1,846,441 = 1,065,176 + 781,265 vertices (stories + words) |

Volume | 96,903,520 edges (inclusions) |

Unique volume | 60,569,726 edges (inclusions) |

Average degree (overall) | 181.95 edges / vertex |

Average story degree | 124.03 edges / vertex |

Average word degree | 341.32 edges / vertex |

Fill | 0.00027307 edges / vertex^{2} |

Maximum degree | 345,056 edges |

Wedge count | 1,546,388,153,215 |

Claw count | 6.4992574078209184 × 10^{16} |

Power law exponent (estimated) with d_{min} | 2.5110 (d_{min} = 59) |

Gini coefficient | 68.0% |

Relative edge distribution entropy | 82.6% |

Assortativity | –0.12469 |

Diameter | 6 edges |

90-percentile effective diameter | 3.33 edges |

Mean shortest path length | 2.69 edges |

Spectral norm | 6502.1 |

## Downloads

TSV file: | reuters.tar.bz2 (284.44 MiB) |