There is a long tail of outputs with 1 citation, which we can assume to be a citation of the output in the original paper describing the work by the same authors. Similarly, we can infer that outputs with more than 1 citation, cited in publication(s) by different authors, have been reused. Here are some other interesting preliminary findings.
4 out of the top 10 are software with a mean citation count of 50.5!
The code repositories have a few things in common.
- Each has a README file
- Each has multiple versions
- Each has lots of metadata associated with it. Sometimes this description is includeddirectly on the landing page, as is the case with PIVlab – Time-Resolved Digital Particle Image Velocimetry Tool for MATLAB (https://doi.org/10.6084/m9.figshare.1092508). For others, we find context to the work on other sites that link directly to the output, which are picked up by the Altmetric Explorer tool. Whether it be a Wikipedia article describing the software, as is the case for Common Workflow Language, v1.0 ( https://doi.org/10.6084/m9.figshare.3115156.v2) or blog posts that describe the The graph-tool python library (https://doi.org/10.6084/m9.figshare.1164194.v14) or PyMKS: Materials Knowledge System in Python (https://doi.org/10.6084/m9.figshare.1015761.v2)
Interestingly, there are 12 outputs labelled by authors as software in the top 100, despite software making up just 1.6% of outputs on figshare.com. This suggests that software can be highly reused and the impact it is having is not being captured in traditional credit mechanisms.
32 out of the top 100 are datasets
As of today, outputs labelled by authors as datasets make up 30% of outputs on Figshare.com. Therefore, finding that 32% of the top 100 cited outputs are datasets seems logical.
Obviously with the many different types of outputs that researchers want credit for, there are some that lend themselves to reuse and traditional citation metrics better than others. Notably, posters and presentations rarely get cited. But this does not mean that it isn’t happening. There is 1 presentation and 2 posters in the 100 most cited outputs.
The author of said presentation has picked up multiple citations over multiple different outputs, suggesting that the credibility of the author and their work is a key factor in driving impact.
Soon Figshare’s newly launched faceted search functionality will also include the ability to filter search results by citation count and Altmetric score in any Figshare portal. We’re excited about the new ways to encourage researchers to share all of their research outputs and make sure they get credit for the work. We look forward to continuing to investigate patterns of data reuse and working with the community on methods to make all the products of research more reusable.
Related posts
- « Previous
- 1
- …
- 8
- 9
- 10