Dataset
PubMed Co-Citation
Network Type
The PubMed co-citation network is a hypergraph where nodes are scientific papers from the PubMed database and hyperedges are sets of papers that are co-cited together by another paper. Each node is labeled with the paper's research topic, chosen from three diabetes-related categories: Diabetes Mellitus Experimental, Diabetes Mellitus Type 1, and Diabetes Mellitus Type 2. This dataset is commonly used as a benchmark in hypergraph learning tasks, particularly for biomedical document classification.
Dataset Statistics
Nodes
Nodes
19,717
Node Type
Scientific Paper
Node Label
Research Topic
Node Degree
Min1
Q14
Median7
Q312
Max99
Node Label Distribution
Hyperedges
Hyperedges
7,963
Hyperedge Type
Co-Citation Set
Hyperedge Degree
Min2
Q12
Median3
Q34
Max171