**CiteSeer**

# CiteSeer

## About this network

This is the citation network extracted from the CiteSeer digital library. Nodes are publications and the directed edges denote citations. Publications can cite themselves in this dataset, and therefore the network includes loops. The network was extracted by hand by Jérôme Kunegis before the start of the KONECT project, and therefore no extraction code remains.

## Network info

Code | CS |

Category | ⬤ Citation |

Data source | http://citeseer.ist.psu.edu/oai.html |

Vertex type | Publication |

Edge type | Citation |

Format | Directed |

Edge weights | Unweighted |

Metadata | Loop |

Size | 384,413 vertices (publications) |

Volume | 1,751,463 edges (citations) |

Average degree (overall) | 9.1124 edges / vertex |

Fill | 1.1852 × 10^{–5} edges / vertex^{2} |

Maximum degree | 1,739 edges |

Reciprocity | 1.36% |

Size of LCC | 365,154 vertices |

Size of LSCC | 16,208 vertices |

Wedge count | 81,727,579 |

Claw count | 9,032,698,085 |

Triangle count | 1,351,820 |

Square count | 25,382,500 |

4-tour count | 533,442,606 |

Power law exponent (estimated) with d_{min} | 2.7310 (d_{min} = 20) |

Gini coefficient | 57.9% |

Relative edge distribution entropy | 94.5% |

Assortativity | –0.061826 |

Clustering coefficient | 4.96% |

Diameter | 34 edges |

90-percentile effective diameter | 7.96 edges |

Mean shortest path length | 6.35 edges |

Spectral norm | 58.653 |

Algebraic connectivity | 0.0046051 |

## Downloads

TSV file: | citeseer.tar.bz2 (7.40 MiB) |