Dataset

CiteSeer Co-Citation

Download Dataset

Network Type

Overview

The CiteSeer co-citation network is a hypergraph where nodes are scientific papers from the CiteSeer dataset and hyperedges are sets of papers that are co-cited together by another paper. Each node is labeled with the paper's research field, chosen from six computer science topics: Agents, Artificial Intelligence (AI), Databases (DB), Information Retrieval (IR), Machine Learning (ML), and Human-Computer Interaction (HCI). This dataset is commonly used as a benchmark in hypergraph learning tasks.

Statistics

Nodes

Nodes

3,312

Node Type

Scientific Paper

Node Label

Research Field

Node Degree

Min1
Q11
Median1
Q32
Max88

Node Label Distribution

6 unique labels · imbalance degree: 1.27

Hyperedges

Hyperedges

1,079

Hyperedge Type

Co-Citation Set

Hyperedge Degree

Min2
Q12
Median2
Q34
Max26