view article Article ๐ฅ Argilla 2.0: the data-centric tool for AI makers ๐ค By dvilasuero โข Jul 30 โข 37
PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic Thinking Paper โข 2410.12375 โข Published Oct 16 โข 2
Standard-format-preference-dataset Collection We collect the open-source datasets and process them into the standard format. โข 14 items โข Updated May 8 โข 22