Dataset
CiteSeer Co-Citation
Network Type
Overview
The CiteSeer co-citation network is a hypergraph where nodes are scientific papers from the CiteSeer dataset and hyperedges are sets of papers that are co-cited together by another paper. Each node is labeled with the paper's research field, chosen from six computer science topics: Agents, Artificial Intelligence (AI), Databases (DB), Information Retrieval (IR), Machine Learning (ML), and Human-Computer Interaction (HCI). This dataset is commonly used as a benchmark in hypergraph learning tasks.
Statistics
Nodes
Nodes
3,312
Node Type
Scientific Paper
Node Label
Research Field
Node Degree
Min1
Q11
Median1
Q32
Max88
Node Label Distribution
6 unique labels · imbalance degree: 1.27
Hyperedges
Hyperedges
1,079
Hyperedge Type
Co-Citation Set
Hyperedge Degree
Min2
Q12
Median2
Q34
Max26