Daily Driver's/ Current Favorite's Collection Smart, great at rp. What more do i say? • 2 items • Updated Nov 4 • 11
view article Article 🚨 ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming By sted97 • Jun 25 • 4
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks Paper • 2412.14161 • Published 5 days ago • 43
Accelerated Preference Optimization for Large Language Model Alignment Paper • 2410.06293 • Published Oct 8 • 5
Canonical models Collection This collection lists all the historical (pre-"Hub") canonical model checkpoints, i.e. repos that were not under an org or user namespace • 68 items • Updated Feb 13 • 14
ELM Collection Collection of various ELM models from "Erasing Conceptual Knowledge from Language Models" • 4 items • Updated Oct 21 • 2
SimPO Collection This collections contains a list of SimPO and baseline models. • 49 items • Updated Nov 7 • 17
WPO Collection Models and datasets in paper "WPO: Enhancing RLHF with Weighted Preference Optimization". • 11 items • Updated Aug 22 • 5
Gemma-2-9B-it-Advanced Collection Merges of the advanced research fine tunes of gemma-2 9B it • 3 items • Updated Oct 20 • 1
An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models Paper • 2403.06764 • Published Mar 11 • 26
How Far Are We from Intelligent Visual Deductive Reasoning? Paper • 2403.04732 • Published Mar 7 • 19