6 35 53

Lê Võ Quyết Thắng

thangvip

AI & ML interests

Adapting LLM to specific domain

Recent Activity

liked a dataset about 7 hours ago

yuyijiong/patient-math-cot

upvoted a paper about 7 hours ago

Natural Language Reinforcement Learning

upvoted a paper about 7 hours ago

Hymba: A Hybrid-head Architecture for Small Language Models

View all activity

Organizations

thangvip's activity

liked a dataset about 7 hours ago

yuyijiong/patient-math-cot

Viewer • Updated 5 days ago • 5.01k • 16 • 2

upvoted 2 papers about 7 hours ago

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published 5 days ago • 25

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published 6 days ago • 35

upvoted a collection about 22 hours ago

Tulu 3 Datasets

Collection

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated 5 days ago • 42

upvoted a paper 1 day ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published 4 days ago • 46

liked a Space 1 day ago

Running on CPU Upgrade

11.9k

🏆

Open LLM Leaderboard 2

Track, rank and evaluate open LLMs and chatbots

liked a model 1 day ago

google/gemma-2-2b-it

Text Generation • Updated Aug 27 • 881k • • 693

liked a dataset 1 day ago

HuggingFaceTB/smoltalk

Viewer • Updated about 4 hours ago • 2.2M • 1.04k • 134

upvoted a paper 5 days ago

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15 • 101

upvoted a paper 6 days ago

Stronger Models are NOT Stronger Teachers for Instruction Tuning

Paper • 2411.07133 • Published 15 days ago • 30

liked a dataset 9 days ago

microsoft/orca-agentinstruct-1M-v1

Viewer • Updated 26 days ago • 1.05M • 3.2k • 361

updated a model 15 days ago

thangvip/vlama-1b-instruct

Text Generation • Updated 15 days ago • 10

updated a collection 23 days ago

Vilawqa evaluation datasets

Collection

Datasets for evaluating vilaw model • 5 items • Updated 23 days ago

updated a dataset 23 days ago

thangvip/vilawqa

Viewer • Updated 23 days ago • 1.02k • 42

upvoted a paper 30 days ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22 • 88

updated a model about 1 month ago

thangvip/vlama-1b

Text Generation • Updated Oct 24 • 15

liked a model about 1 month ago

togethercomputer/evo-1-131k-base

Text Generation • Updated 8 days ago • 14.7k • 91

updated a dataset about 1 month ago

AIForge/tokenized_ds_clm_vlama

Viewer • Updated Oct 18 • 285k • 31

upvoted 2 papers about 1 month ago

From Generalist to Specialist: Adapting Vision Language Models via Task-Specific Visual Instruction Tuning

Paper • 2410.06456 • Published Oct 9 • 35

GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment

Paper • 2410.08193 • Published Oct 10 • 3