leukas 's Collections

CUTE

The CUTE benchmark is an LLM benchmark, testing LLMs' understanding of orthography. Check out our github here: https://github.com/Leukas/cute