
Atropos Health’s Arjun Mukerji, PhD, Explains RWESummary: A Framework and Test for Choosing LLMs to Summarize Real-World Evidence (RWE) Studies
Deep Papers
00:00
Benchmark Design, Metrics, and Model Results
Explains the RWE Summary benchmark: three LLM-jury evaluations (direction, numbers, completeness), weighting choices prioritizing direction of effect, the models tested, and the evaluation outcomes.
Transcript
Play full episode