Renat's picture

7 3

Renat

u-brixton

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding

upvoted a paper 4 days ago

Iterative Value Function Optimization for Guided Decoding

View all activity

Organizations

u-brixton's activity

upvoted a paper 3 days ago

KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding

Paper • 2503.02951 • Published 5 days ago • 25

upvoted a paper 4 days ago

Iterative Value Function Optimization for Guided Decoding

Paper • 2503.02368 • Published 6 days ago • 14

upvoted a collection 3 months ago

Standard-format-preference-dataset

We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8, 2024 • 24

upvoted 3 collections about 1 year ago

🔍 Interpretability & Analysis of LMs

Outstanding research in LM interpretability and evaluation, summarized • 104 items • Updated 4 days ago • 97

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 233

Zeroshot Classifiers

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 12 items • Updated Jan 6 • 128

upvoted a collection over 1 year ago

Reward models on the hub

UNMAINTAINED: See RewardBench... A place to collect reward models, an often not released artifact of RLHF. • 18 items • Updated Apr 13, 2024 • 25