Data collection and analysis
Three authors (A.S., S.T., and A. Su.) first conducted a calibration exercise with 50 randomly selected records. We reviewed our categorizations and identified any areas of disagreement to refine our coding process. Then two authors (A.S. and S.T.) independently and in duplicate assessed and categorized each citation against the index citation using the definition of each outcome. These categorizations were reconciled and any conflicts were discussed with a third author (A. Su.). Metadata were also collected directly from WoS. These included research area, article type, country of origin, and funding sources for each article.
Proportions between categories were compared using Chi-squared or Fisher Exact tests. All tests were two sided and p<0.05 was considered statistically significant.