Rejected article tracking with the CrossRef API

Adam Day
Adam Day
Jun 20 · 4 min read

CrossRef RAT evaluation

Distributions of Levenshtein distance (t_sim) for correct and incorrect results.
correct_yn is the thing we are trying to predict. 1.0 for a correct result and 0.0 for an incorrect one. The ‘correct_yn’ row in the matrix shows how this quantity correlates with other variables. match_all = 1.0 if all author names match on a result, 0.0 if not. cr_score = the score provided by CrossRef with each result. rank = the rank of the result among all results returned by CrossRef. n_days = no. of days on arXiv before journal publication.

The caveats…


Adam Day

Written by

Adam Day

Data scientist working in research communication. #webapps #python #machinelearning #ai

Welcome to a place where words matter. On Medium, smart voices and original ideas take center stage - with no ads in sight. Watch
Follow all the topics you care about, and we’ll deliver the best stories for you to your homepage and inbox. Explore
Get unlimited access to the best stories on Medium — and support writers while you’re at it. Just $5/month. Upgrade