Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering Paper โข 2411.11504 โข Published 8 days ago โข 18
Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning Paper โข 2410.22304 โข Published 27 days ago โข 15
Training Language Models to Self-Correct via Reinforcement Learning Paper โข 2409.12917 โข Published Sep 19 โข 135
Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning Paper โข 2410.22304 โข Published 27 days ago โข 15
Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning Paper โข 2410.22304 โข Published 27 days ago โข 15 โข 2
SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI Paper โข 2410.11096 โข Published Oct 14 โข 12
MIRAI: Evaluating LLM Agents for Event Forecasting Paper โข 2407.01231 โข Published Jul 1 โข 16 โข 3
Enhancing Large Vision Language Models with Self-Training on Image Comprehension Paper โข 2405.19716 โข Published May 30
view post Post 1248 Reply Check out our new benchmark paper on LLM agents for global events forecasting! MIRAI: Evaluating LLM Agents for Event Forecasting (2407.01231) ๐ Arxiv: https://arxiv.org/abs/2407.01231๐ Project page: https://mirai-llm.github.io๐ป GitHub Repo: https://github.com/yecchen/MIRAI๐ Dataset: https://drive.google.com/file/d/1xmSEHZ_wqtBu1AwLpJ8wCDYmT-jRpfrN/view?usp=sharing๐ Interactive Demo Notebook: https://colab.research.google.com/drive/1QyqT35n6NbtPaNtqQ6A7ILG_GMeRgdnO?usp=sharing โค๏ธ 2 2 +
view post Post 1248 Reply Check out our new benchmark paper on LLM agents for global events forecasting! MIRAI: Evaluating LLM Agents for Event Forecasting (2407.01231) ๐ Arxiv: https://arxiv.org/abs/2407.01231๐ Project page: https://mirai-llm.github.io๐ป GitHub Repo: https://github.com/yecchen/MIRAI๐ Dataset: https://drive.google.com/file/d/1xmSEHZ_wqtBu1AwLpJ8wCDYmT-jRpfrN/view?usp=sharing๐ Interactive Demo Notebook: https://colab.research.google.com/drive/1QyqT35n6NbtPaNtqQ6A7ILG_GMeRgdnO?usp=sharing โค๏ธ 2 2 +
MIRAI: Evaluating LLM Agents for Event Forecasting Paper โข 2407.01231 โข Published Jul 1 โข 16 โข 3