User: Guest  Login
Title:

Autonomous medical evaluation for guideline adherence of large language models.

Document type:
Journal Article
Author(s):
Fast, Dennis; Adams, Lisa C; Busch, Felix; Fallon, Conor; Huppertz, Marc; Siepmann, Robert; Prucker, Philipp; Bayerl, Nadine; Truhn, Daniel; Makowski, Marcus; Löser, Alexander; Bressem, Keno K
Abstract:
Autonomous Medical Evaluation for Guideline Adherence (AMEGA) is a comprehensive benchmark designed to evaluate large language models' adherence to medical guidelines across 20 diagnostic scenarios spanning 13 specialties. It includes an evaluation framework and methodology to assess models' capabilities in medical reasoning, differential diagnosis, treatment planning, and guideline adherence, using open-ended questions that mirror real-world clinical interactions. It includes 135 questions and...     »
Journal title abbreviation:
NPJ Digit Med
Year:
2024
Journal volume:
7
Journal issue:
1
Fulltext / DOI:
doi:10.1038/s41746-024-01356-6
Pubmed ID:
http://view.ncbi.nlm.nih.gov/pubmed/39668168
TUM Institution:
Institut für Diagnostische und Interventionelle Radiologie (Prof. Makowski)
 BibTeX