During the Rubber-meets-the-road panel at the 2019 STM Conference in London, Anita Bandrowski and Martijn Roelandse presented new means to measure transparency and reproducibility of biomedical journals.
These means are based on SciScore , a tool that can evaluate whether the authors have addressed blinding, sex, and randomization of subjects into groups, power analysis, as well as key resources. These are all difficult and tedious things for humans to check, but critical if we want to measure and ultimately improve the quality of the science being conducted and published.
The first results have been presented in London.
2. Reproducibility
crisis
the effect size of a poorly controlled study is about 50% bigger than the effect size of
a well controlled study.
Is it possible that poorly controlled animal studies are repeated using proper controls
in clinical trials and fail because the effects were never significant to begin with?
5. Sowhere
arewe now?
NIF, INCF, members of the NIH, and about 25 major journal Editors in Chief,
began to talk about research resource reproducibility
2012: 1st meeting at the Commander's Palace @ Society for Neuroscience
2013: 2nd meeting at NIH
2014: Pilot project started; 25 journals would ask authors to provide RRIDs
for 3 months, 2 journals started on time
we are currently in
5thyear of a
3-month pilot
6. RRIDs = Better papers
Bandrowski et al,
2015a,b,c,d
Data is based
on the RRID
pilot, first 100
papers
RRIDs=Betterpapers
Control:
n=
150,459
RRID:
n=634
Babic et al, eLife,
2019
66% decrease
in naughty cell
lines with
RRID
7. Nextstep:
SciSCore- thetoolthat
makesRRIDsa reality
SciScore checks whether the authors address sex, blinding,
randomization of subjects into groups, power analysis, as well as key
resources.
The tool produces a score that roughly corresponds to the number
of criteria filled in vs the number that were expected.
8. try this today @ sciscore.com
free version via ORCID
SciScore.comisfreelyaccessibleforauthorsandit
isintendedtoimprovemanuscripts
Free
Trial
9. Copy methods section,
paste into sciscore.com
to create a report
SciScoretakesasinput themethods
sectionof manuscripts
10. The score is a 5 out of 10
The rigor table pulls sentences from
the methods section that fit the criteria.
For example, in this paper SciScore
detected that power analysis was
present. +1
Statements on Blinding or Cell Line
Authentication were not detected by
SciScore. +0
Authors
sentence
detected
TheSciScoreReportcontains2tables:
Rigor&Resources
11. The resources table
pulls sentences from the
methods section that
contain some resource,
organized by type.
When information
matches the wrong
identifier or a
problematic resource
SciScore warns authors.
Expected
Information is
recognized
(+1)
Expected
Information is
missing (+0)
Expected
Information is
missing but
retrievable
TheSciScoreReportcontains2tables:
Rigor&Resources
12. ~30 algorithms that work in concert to
identify named entities
classify papers / sections
Lookup tables for reagents
Classifier types used:
neural networks
standard NER
POS, sentence diagrams
Reports are assembled by rules,
if a cell line is detected -> detect cell line authentication
If a cell line is contaminated -> red error message
SciScore-howitwasMADE
13. Step 1: annotate sentences:
Step 2: algorithm training
Step 3: check different sentences
SciScore-howitwasMADE
Classifier Type F1 Precis. Recall Training Set Size
Rigor Criteria
Institutional Review Board 76.9 88.2 68.2 340
Consent Statement 96.8 97.8 95.7 373
Animal Care Statement 77.9 82.2 74.0 591
Randomization of subjects 80.6 86.2 75.8 368
Blinding of investigator or analysis 96.3 100 92.9 183
Power analysis for group size 90.9 83.3 100 81
Sex as a biological variable 92.6 98.9 87.0 862
Cell Line Authentication 66.7 76.9 58.8 155
Cell Line Contamination 85.7 90.0 81.8 151
Key Biological Resources
Antibody 78.8 87.2 71.9 16,772
Organism 71.6 81.6 63.8 4,439
Cell Line 72.1 79.2 66.1 1,763
Software Project/Tool 89.8 94.1 85.8 10,161
Sentence 2 (methods sentence line 125; PMID:28638484)
For cellular uptake kinetics study, HeLa (false negative) or
RAW264.7 (correct annotation) cells were seeded into 96-
well plates and allowed to attach for 24h.
Sentence 1 (methods sentence line 353; PMID:26012578)
For luciferase activity assays, HeLa or HCN-A94 cells were
grown in 24 well plates and transfected with 0.1 亮g phRL-
TK-10BOXB plasmid, 0.1 亮g of pGL3 promoter plasmid and
with 0.7 亮g of one of the six pCl- 了N-HA-tagged UPF3B
expression constructs.
Cell Line
19. Outlook
**Coming soon**
Additional MDAR support
eJournal Press Integration
Aries Integration
Aggregation of scores on university / funder / researcher level
Exploring integration with other disciplines / tools
Current Pilots:
British Journal of Pharmacology (8 mos/2019 SciScore: 6.28)
Brain & Behavior (5 mos/2019 SciScore: 5.46)
10 Springer Journals *New Pilot*
eLife *New Pilot*
20. councilorofA USUniversity
Did you check how
MIT is doing in your
analysis? I bet theyre
worse than we are.
try this today @ sciscore.com
free version via ORCID