Parse judgments with structured output prompting, one response model, one judge model at a time. eb4ec23 justinxzhao commited on Oct 3, 2024
Some refactoring, judging responses for direct assessment. 577870e justinxzhao commited on Sep 29, 2024