Benutzer: Gast  Login
Titel:

Autonomous medical evaluation for guideline adherence of large language models.

Dokumenttyp:
Journal Article
Autor(en):
Fast, Dennis; Adams, Lisa C; Busch, Felix; Fallon, Conor; Huppertz, Marc; Siepmann, Robert; Prucker, Philipp; Bayerl, Nadine; Truhn, Daniel; Makowski, Marcus; Löser, Alexander; Bressem, Keno K
Abstract:
Autonomous Medical Evaluation for Guideline Adherence (AMEGA) is a comprehensive benchmark designed to evaluate large language models' adherence to medical guidelines across 20 diagnostic scenarios spanning 13 specialties. It includes an evaluation framework and methodology to assess models' capabilities in medical reasoning, differential diagnosis, treatment planning, and guideline adherence, using open-ended questions that mirror real-world clinical interactions. It includes 135 questions and...     »
Zeitschriftentitel:
NPJ Digit Med
Jahr:
2024
Band / Volume:
7
Heft / Issue:
1
Volltext / DOI:
doi:10.1038/s41746-024-01356-6
PubMed:
http://view.ncbi.nlm.nih.gov/pubmed/39668168
TUM Einrichtung:
Institut für Diagnostische und Interventionelle Radiologie (Prof. Makowski)
 BibTeX