Anikait Singh's picture

4 5 2

Anikait Singh

Asap7772

·

https://asap7772.github.io

AI & ML interests

Deep Learning, Reinforcement Learning, Robotics

Recent Activity

updated a dataset about 3 hours ago

Asap7772/metamath-rl-hint-random-4

published a dataset about 3 hours ago

Asap7772/metamath-rl-hint-random-4

updated a dataset about 3 hours ago

Asap7772/metamath-rl-hint-random-2

View all activity

Organizations

Papers 9

arxiv:2503.01307

arxiv:2502.19312

arxiv:2501.04682

arxiv:2410.02725

models 56

Asap7772/smollm2_lr1e4_10ep_binned_sft

Text Generation • Updated 3 days ago • 5

Asap7772/smollm2_lr1e4_7ep_binned_sft

Text Generation • Updated 3 days ago • 4

Asap7772/smollm2_lr1e4_5ep_binned_sft

Text Generation • Updated 3 days ago • 7

Asap7772/smollm2_lr1e4_2ep_binned_sft

Text Generation • Updated 3 days ago • 6

Asap7772/smollm2_lr5e5_10ep_binned_sft

Text Generation • Updated 3 days ago • 6

Asap7772/smollm2_lr5e5_7ep_binned_sft

Text Generation • Updated 3 days ago • 5

Asap7772/smollm2_lr5e5_5ep_binned_sft

Text Generation • Updated 3 days ago • 6

Asap7772/smollm2_lr5e5_2ep_binned_sft

Text Generation • Updated 3 days ago • 6

Asap7772/smollm2_lr1e5_10ep_binned_sft

Text Generation • Updated 3 days ago • 4

Asap7772/smollm2_lr1e5_7ep_binned_sft

Text Generation • Updated 3 days ago • 9

datasets 1112

Asap7772/metamath-rl-hint-random-4

Viewer • Updated about 3 hours ago • 104k

Asap7772/metamath-rl-hint-random-2

Updated about 3 hours ago • 62

Asap7772/metamath-rl-hint-topk-4

Viewer • Updated about 4 hours ago • 104k • 59

Asap7772/metamath-rl-hint-topk-2

Viewer • Updated about 4 hours ago • 51.9k • 59

Asap7772/metamath-rl-hint-no_hint-2

Viewer • Updated about 4 hours ago • 26k • 60

Asap7772/metamath-hint-sft-rand-2-proc

Viewer • Updated about 4 hours ago • 18.4k • 62

Asap7772/metamath-hint-sft-rand-4-proc

Viewer • Updated about 4 hours ago • 26.2k

Asap7772/metamath-hint-sft-topk-4-proc

Viewer • Updated about 4 hours ago • 26.2k

Asap7772/metamath-hint-sft-topk-2-proc

Viewer • Updated about 4 hours ago • 18.4k • 51

Asap7772/metamath-hint-sft-nohint-proc

Viewer • Updated about 4 hours ago • 28.5k • 55