The Association Between Early ArXiv Posting and Citations

AAAI     3726
NIPS 3393
IJCAI 3001
WWW 2958
ACL 2676
ICML 2200
KDD 1661
ECCV 1477
EMNLP 1248
SODA 1234
HLT-NAACL 876
CVPR 467
FOCS 305
INFOCOM 183
ICRA 182
ICCV 156
  • All citations (cites_1year): the number of times the paper in question was cited by any other paper published during the calendar year following the conference.
  • Influential citations (influential_cites_1year): similar to cites_1year but captures a smaller subset of citations which are more likely to indicate that the paper in question is critical for the citing paper. It only counts non-self citations (i.e., with no overlap in the author lists) where the paper of interest is referenced three times or more in the narrative of the citing paper, not always combined with other references, mentioned in context of experimental results, or explicitly mentioned as foundation for the citing paper.
next_year_jan_1 = datetime(year=conf_year + 1, month=1, day=1).date()delta = next_year_jan_1 — arxiv_submission_datefrac_year_remaining = np.maximum(delta.days / 365, 0)
(arxiv_submission_date — conference_deadline_date).days
  • cites_1year — number of papers that cited p and were published in the calendar year following the official publication of p (continuous).
  • influential_cites_1year — number of influential papers that cited p and were published in the calendar year following the official publication of p (continuous).
  • max_hindex_decile — the decile into which the maximum (across all authors) h-index of p falls into (categorical — 10 levels).
  • submitted_before_deadline— whether p was submitted before the conference deadline plus 28 days (binary).
  • frac_year_remaining— fraction of year remaining from arXiv submission date until the year after the conference in which paper p was published (continuous).
  • conf — the conference where p was published (categorical — 16 levels).
cites_1year ~ max_hindex_decile + frac_year_remaining + conf
cites_1year ~ max_hindex_decile + frac_year_remaining + conf + submitted_before_deadline

--

--

--

Senior Applied Research Scientist @ allenai.org, and Machine learning Consulting @ data-cowboys.com

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

How To Compile TensorFlow 2.3 with CUDA 11.1

The Amazing Adventures of GOJEK’s Real Data Scientists

6 Research Papers about Machine Learning Deployment Phase

What You Ought To Learn AboutCases https://t.co/FOQMTa09VG

2001: A Data Culture War

project 3: improving medical appointment attendance via appointment classification

5 Computer Vision and Deep Learning Fundamentals

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Sergey Feldman

Sergey Feldman

Senior Applied Research Scientist @ allenai.org, and Machine learning Consulting @ data-cowboys.com

More from Medium

Spiderman: Hero or Foe?

Midterm Reflection

Not The Most Pleasurable Watch, But An Interesting One Nonetheless

The story of a “smart”​ QA inspector who does it the “Google”​ way