Dataset

CiteSeer Co-Citation

Network Type

The CiteSeer co-citation network is a hypergraph where nodes are scientific papers from the CiteSeer dataset and hyperedges are sets of papers that are co-cited together by another paper. Each node is labeled with the paper's research field, chosen from six computer science topics: Agents, Artificial Intelligence (AI), Databases (DB), Information Retrieval (IR), Machine Learning (ML), and Human-Computer Interaction (HCI). This dataset is commonly used as a benchmark in hypergraph learning tasks.

Dataset Statistics

Nodes

Nodes

3,312

Node Type

Scientific Paper

Node Label

Research Field

Node Degree

Min1
Q11
Median1
Q32
Max88

Node Label Distribution

Hyperedges

Hyperedges

1,079

Hyperedge Type

Co-Citation Set

Hyperedge Degree

Min2
Q12
Median2
Q34
Max26