Identification of human gene research articles with wrongly identified nucleotide sequences - IRIT - Institut de Recherche en Informatique de Toulouse Accéder directement au contenu
Article Dans Une Revue Life Science Alliance Année : 2022

Identification of human gene research articles with wrongly identified nucleotide sequences

Résumé

Nucleotide sequence reagents underpin molecular techniques that have been applied across hundreds of thousands of publications. We have previously reported wrongly identified nucleotide sequence reagents in human research publications and described a semi-automated screening tool Seek & Blastn to fact-check their claimed status. We applied Seek & Blastn to screen >11,700 publications across five literature corpora, including all original publications in Gene from 2007 to 2018 and all original open-access publications in Oncology Reports from 2014 to 2018. After manually checking Seek & Blastn outputs for >3,400 human research articles, we identified 712 articles across 78 journals that described at least one wrongly identified nucleotide sequence. Verifying the claimed identities of >13,700 sequences highlighted 1,535 wrongly identified sequences, most of which were claimed targeting reagents for the analysis of 365 human protein-coding genes and 120 non-coding RNAs. The 712 problematic articles have received >17,000 citations, including citations by human clinical trials. Given our estimate that approximately one-quarter of problematic articles may misinform the future development of human therapies, urgent measures are required to address unreliable gene research articles.
Fichier principal
Vignette du fichier
ParkEtAl2022.pdf (2.06 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-03523959 , version 1 (12-01-2022)

Licence

Paternité

Identifiants

Citer

Yasunori Park, Rachael West, Pranujan Pathmendra, Bertrand Favier, Thomas Stoeger, et al.. Identification of human gene research articles with wrongly identified nucleotide sequences. Life Science Alliance, 2022, 5 (4), pp.e202101203. ⟨10.26508/lsa.202101203⟩. ⟨hal-03523959⟩
531 Consultations
89 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More