view article Article ๐ฅ Argilla 2.0: the data-centric tool for AI makers ๐ค By dvilasuero โข Jul 30, 2024 โข 37
PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic Thinking Paper โข 2410.12375 โข Published Oct 16, 2024 โข 3
Standard-format-preference-dataset Collection We collect the open-source datasets and process them into the standard format. โข 14 items โข Updated May 8, 2024 โข 23
chouPAPI21/Llama-3.2-11B-Vision-Instruct-educational-assistant-w-ocr-docparsing Text2Text Generation โข Updated Oct 21, 2024 โข 4
ScalableMath/llemma-7b-orm-prm800k-level-1to3-hf Text Generation โข Updated Mar 1, 2024 โข 35 โข 1