- AI · arxiv/cs.LG · 8 min
LLM Panels Match Expert Clinicians in Medical Diagnosis Scoring
A study of three frontier AI models scoring real hospital cases shows calibrated LLM juries can reliably replace human expert panels for medical AI evaluation.
April 17, 2026 Read → → - AI · arxiv/cs.LG · 4 min
Retrieval-Augmented Set Completion for Clinical Code Authoring
A two-stage approach retrieves similar clinical value sets then classifies candidates, outperforming direct LLM generation on standardized medical vocabularies.
April 17, 2026 Read → → - AI · arxiv/cs.LG · 4 min
Retrieval beats memorization for clinical code selection
A two-stage retrieval-then-classify method outperforms direct LLM generation for assembling clinical value sets from large standardized vocabularies.
April 17, 2026 Read → →