Dataset

Cora Co-Authorship

Network Type

The Cora co-authorship network is a hypergraph where nodes are authors from the Cora dataset and hyperedges are papers, representing sets of authors who collaborated on a publication. Each author is labeled with their primary research area, inferred from the majority research area of their publications. The research areas include Case-Based Reasoning, Genetic Algorithms, Neural Networks, Probabilistic Methods, Reinforcement Learning, Rule Learning, and Theory. This dataset is commonly used as a benchmark in hypergraph learning tasks for author classification and link prediction.

Dataset Statistics

Nodes

Nodes

1,072

Node Type

Author

Node Label

Research Area

Node Degree

Min1
Q12
Median2
Q34
Max28

Node Label Distribution

Hyperedges

Hyperedges

1,393

Hyperedge Type

Publication

Hyperedge Degree

Min2
Q12
Median2
Q33
Max23

Changelog

Revision 2
  • Update to format version 0.3.
  • Drop papers with only one author.