13 25 65

Asaf Yehudai

Asaf-Yehudai

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Byte Latent Transformer: Patches Scale Better Than Tokens

authored a paper 7 days ago

When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes

authored a paper 7 days ago

Benchmark Agreement Testing Done Right: A Guide for LLM Benchmark Evaluation

View all activity

Organizations

Asaf-Yehudai's activity

upvoted a paper 5 days ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published 10 days ago • 69

authored 4 papers 7 days ago

When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes

Paper • 2404.12365 • Published Apr 18 • 1

Benchmark Agreement Testing Done Right: A Guide for LLM Benchmark Evaluation

Paper • 2407.13696 • Published Jul 18 • 5

Selective Self-Rehearsal: A Fine-Tuning Approach to Improve Generalization in Large Language Models

Paper • 2409.04787 • Published Sep 7

JuStRank: Benchmarking LLM Judges for System Ranking

Paper • 2412.09569 • Published 10 days ago • 18

liked a model 7 days ago

Nexusflow/Athene-V2-Chat

Text Generation • Updated 26 days ago • 7k • 247

liked a Space 9 days ago

Running

🧑🏻‍⚖️

JuStRank

upvoted a paper 10 days ago

JuStRank: Benchmarking LLM Judges for System Ranking

Paper • 2412.09569 • Published 10 days ago • 18

upvoted a paper 11 days ago

Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI

Paper • 2401.14019 • Published Jan 25 • 21

liked a Space 11 days ago

Sleeping

💻

JuStRank

upvoted 2 collections 29 days ago

multimodal

Collection

194 items • Updated 2 days ago • 6

VisionLM

Collection

561 items • Updated 3 days ago • 39

upvoted a paper about 1 month ago

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Paper • 2405.01434 • Published May 2 • 52

upvoted a paper about 2 months ago

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Paper • 2410.15522 • Published Oct 20 • 11

liked a model about 2 months ago

stabilityai/stable-diffusion-3.5-large

Text-to-Image • Updated Oct 22 • 169k • • 1.66k

liked a dataset about 2 months ago

math-ai/TemplateGSM

Viewer • Updated 24 days ago • 11.6M • 562 • 13

liked a model about 2 months ago

genmo/mochi-1-preview

Text-to-Video • Updated 4 days ago • 31.8k • 1.12k

liked a dataset 2 months ago

google/frames-benchmark

Viewer • Updated Oct 15 • 824 • 1.63k • 176

upvoted a paper 2 months ago

LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content

Paper • 2410.10783 • Published Oct 14 • 26

liked a model 2 months ago

rhymes-ai/Aria

Image-Text-to-Text • Updated 5 days ago • 18.6k • 598