Code LLM graded evaluation with Gitlab GPU runners
Measuring the quality of LLM-generated output is a complex task with direct societal relevance, by mitigating bias & harm, and improving accuracy. This POC pioneers LLM evaluation within GitLab MLops