Highly accurate classification of chest radiographic reports using a deep learning natural language model pre-trained on 3.8 million text reports.

Bressem, Keno K; Adams, Lisa C; Gaudin, Robert A; Tröltzsch, Daniel; Hamm, Bernd; Makowski, Marcus R; Schüle, Chan-Yong; Vahldiek, Janis L; Niehues, Stefan M

doi:10.1093/bioinformatics/btaa668

2021

Zurück
Zurück zum Anfang der Trefferliste
Dauerhafter Link zum angezeigten Objekt

Titel:: Highly accurate classification of chest radiographic reports using a deep learning natural language model pre-trained on 3.8 million text reports.
Dokumenttyp:: Journal Article
Autor(en):: Bressem, Keno K; Adams, Lisa C; Gaudin, Robert A; Tröltzsch, Daniel; Hamm, Bernd; Makowski, Marcus R; Schüle, Chan-Yong; Vahldiek, Janis L; Niehues, Stefan M
Abstract:: MOTIVATION: The development of deep, bidirectional transformers such as Bidirectional Encoder Representations from Transformers (BERT) led to an outperformance of several Natural Language Processing (NLP) benchmarks. Especially in radiology, large amounts of free-text data are generated in daily clinical workflow. These report texts could be of particular use for the generation of labels in machine learning, especially for image classification. However, as report texts are mostly unstructured, advanced NLP methods are needed to enable accurate text classification. While neural networks can be used for this purpose, they must first be trained on large amounts of manually labelled data to achieve good results. In contrast, BERT models can be pre-trained on unlabelled data and then only require fine tuning on a small amount of manually labelled data to achieve even better results. RESULTS: Using BERT to identify the most important findings in intensive care chest radiograph reports, we achieve areas under the receiver operation characteristics curve of 0.98 for congestion, 0.97 for effusion, 0.97 for consolidation and 0.99 for pneumothorax, surpassing the accuracy of previous approaches with comparatively little annotation effort. Our approach could therefore help to improve information extraction from free-text medical reports. Availability and implementationWe make the source code for fine-tuning the BERT-models freely available at https://github.com/fast-raidiology/bert-for-radiology. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. «
MOTIVATION: The development of deep, bidirectional transformers such as Bidirectional Encoder Representations from Transformers (BERT) led to an outperformance of several Natural Language Processing (NLP) benchmarks. Especially in radiology, large amounts of free-text data are generated in daily clinical workflow. These report texts could be of particular use for the generation of labels in machine learning, especially for image classification. However, as report texts are mostly unstructured, a... »
Zeitschriftentitel:: Bioinformatics
Jahr:: 2021
Band / Volume:: 36
Heft / Issue:: 21
Seitenangaben Beitrag:: 5255-5261
Volltext / DOI:: doi:10.1093/bioinformatics/btaa668
PubMed:: http://view.ncbi.nlm.nih.gov/pubmed/32702106
Print-ISSN:: 1367-4803
TUM Einrichtung:: Institut für Diagnostische und Interventionelle Radiologie
BibTeX

Vorkommen:

mediaTUM Gesamtbestand Hochschulbibliographie 2021 Fakultäten Medizin Institut für Radiologie

mediaTUM Gesamtbestand Einrichtungen Schools TUM School of Medicine and Health Departments Clinical Medicine Institut für Diagnostische und Interventionelle Radiologie (Prof. Makowski)2021