adol's picture

3 2

adol

adol01

AI & ML interests

None yet

Organizations

None yet

adol01's activity

New activity in Qwen/Qwen2-1.5B 6 months ago

Hello, is this 1.5B model trained from scratch, or is it distilled like LLaMA 3.2?

#7 opened 6 months ago by

New activity in TRI-ML/DCLM-1B 6 months ago

MMLU Performance After Token Training

#3 opened 6 months ago by

New activity in Alibaba-NLP/gte-multilingual-base 8 months ago

Do you plan to open-source the training code?

#1 opened 8 months ago by