
Evaluating Large Language Models for Fair and Reliable Organ Allocation
Static evaluations are very brittle—especially concerning for extremely high-steaks medical settings like organ allocations. Turns out, simply asking LLMs to rank recipients (which is what actual organ transplant committees do) breaks apparent LLM fairness.

