Instruction Following without Instruction Tuning Paper β’ 2409.14254 β’ Published 6 days ago β’ 6 β’ 3
WebApp1K: A Practical Code-Generation Benchmark for Web App Development Paper β’ 2408.00019 β’ Published Jul 30 β’ 1
A Case Study of Web App Coding with OpenAI Reasoning Models Paper β’ 2409.13773 β’ Published 9 days ago β’ 4 β’ 2
Insights from Benchmarking Frontier Language Models on Web App Code Generation Paper β’ 2409.05177 β’ Published 19 days ago β’ 5 β’ 3