RELIABLE AND DIVERSE EVALUATION OF LLM MEDICAL KNOWLEDGE MASTERY
该文章于2025年发表在ICLR(CCF A),早在2024年9月发布在arxiv。
文章地址:Reliable and Diverse Evaluation of LLM Medical Knowledge Mastery
arXiv:[2409.14302] Reliable and diverse evaluation of LLM medical knowledge mastery
代码仓库:GitHub - THUMLP/PretexEval: Codes and datasets of the ICLR 2025 paper: Reliable and Diverse Evaluation of LLM Medical Knowledge Mastery
openreview打分: 6 8 8 6
一、概述