JuStRank: Benchmarking LLM Judges for System Ranking Paper โข 2412.09569 โข Published Dec 12, 2024 โข 19
Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models Paper โข 2411.07232 โข Published Nov 11, 2024 โข 63
Knowledge Navigator: LLM-guided Browsing Framework for Exploratory Search in Scientific Literature Paper โข 2408.15836 โข Published Aug 28, 2024 โข 13
Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models Paper โข 2407.19474 โข Published Jul 28, 2024 โข 23
Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP Paper โข 2407.00402 โข Published Jun 29, 2024 โข 22
Evaluating D-MERIT of Partial-annotation on Information Retrieval Paper โข 2406.16048 โข Published Jun 23, 2024 โข 35
view article Article BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks Jun 18, 2024 โข 43
Make It Count: Text-to-Image Generation with an Accurate Number of Objects Paper โข 2406.10210 โข Published Jun 14, 2024 โข 77
cattana/flan-t5-large-qasem-joint-tokenized Text2Text Generation โข Updated Jan 5, 2024 โข 38 โข 2