adol
adol01
AI & ML interests
None yet
Recent Activity
liked
a dataset
about 1 month ago
opencsg/chinese-fineweb-edu-v2
liked
a dataset
2 months ago
BAAI/IndustryCorpus_medicine
new activity
5 months ago
Qwen/Qwen2-1.5B:Hello, is this 1.5B model trained from scratch, or is it distilled like LLaMA 3.2?
Organizations
None yet
adol01's activity
Hello, is this 1.5B model trained from scratch, or is it distilled like LLaMA 3.2?
#7 opened 5 months ago
by
adol01
MMLU Performance After Token Training
#3 opened 5 months ago
by
adol01
Do you plan to open-source the training code?
2
#1 opened 6 months ago
by
adol01