konect logo
KONECT
KONECT > Networks > CiteSeer

CiteSeer

About this network

This is the citation network extracted from the CiteSeer digital library. Nodes are publications and the directed edges denote citations. Publications can cite themselves in this dataset, and therefore the network includes loops. The network was extracted by hand by Jérôme Kunegis before the start of the KONECT project, and therefore no extraction code remains.

Network info

CodeCS
Category Citation
Data source http://citeseer.ist.psu.edu/oai.html
Vertex type Publication
Edge type Citation
FormatDirected: Edges are directed Directed
Edge weightsUnweighted: Simple edges Unweighted
Metadata Loops:  An edge may connect a node with itself Loop
Size384,413 vertices (publications)
Volume1,751,463 edges (citations)
Average degree (overall)9.1124 edges / vertex
Fill1.1852 10–5 edges / vertex2
Maximum degree1,739 edges
Reciprocity1.36%
Size of LCC365,154 vertices
Size of LSCC16,208 vertices
Wedge count81,727,579
Claw count9,032,698,085
Triangle count1,351,820
Square count25,382,500
4-tour count533,442,606
Power law exponent (estimated) with dmin2.7310 (dmin = 20)
Gini coefficient57.9%
Relative edge distribution entropy94.5%
Assortativity–0.061826
Clustering coefficient4.96%
Diameter34 edges
90-percentile effective diameter7.96 edges
Mean shortest path length6.35 edges
Spectral norm58.653
Algebraic connectivity0.0046051
Degree distribution of the CiteSeer network
Degree distribution
Outdegree distribution of the CiteSeer network
Outdegree distribution
Indegree distribution of the CiteSeer network
Indegree distribution
Degree distribution of the CiteSeer network
Degree distribution
Outdegree distribution of the CiteSeer network
Outdegree distribution
Indegree distribution of the CiteSeer network
Indegree distribution
Degree distribution of the CiteSeer network
Degree distribution
Outdegree distribution of the CiteSeer network
Outdegree distribution
Indegree distribution of the CiteSeer network
Indegree distribution
Clustering coefficient distribution of the CiteSeer network
Clustering coefficient distribution
Distance distribution of the CiteSeer network
Distance distribution
Distance distribution on a logistic scale of the CiteSeer network
Distance distribution on a logistic scale
Top-k eigenvalues of A of the CiteSeer network
Top-k eigenvalues of A
Top-k eigenvalues of N of the CiteSeer network
Top-k eigenvalues of N
Spectral distribution of the eigenvalues of A of the CiteSeer network
Spectral distribution of the eigenvalues of A
Spectral distribution of the eigenvalues of N of the CiteSeer network
Spectral distribution of the eigenvalues of N
Spectral distribution of the eigenvalues of L of the CiteSeer network
Spectral distribution of the eigenvalues of L
Cumulative spectral distribution of A of the CiteSeer network
Cumulative spectral distribution of A
Cumulative spectral distribution of N of the CiteSeer network
Cumulative spectral distribution of N
Cumulative spectral distribution of L of the CiteSeer network
Cumulative spectral distribution of L
Complex eigenvalues of the asymmetric adjacency matrix of the CiteSeer network
Complex eigenvalues of the asymmetric adjacency matrix

Downloads

TSV file:downloadciteseer.tar.bz2 (7.40 MiB)

References

[1] Citeseer network dataset -- KONECT, April 2017. [ http ]
[2] Kurt Bollacker, Steve Lawrence, and C. Lee Giles. CiteSeer: An autonomous Web agent for automatic retrieval and identification of interesting publications. In Proc. Int. Conf. on Autonomous Agents, pages 116--123, 1998.

BibTeX