Hello, is this 1.5B model trained from scratch, or is it distilled like LLaMA 3.2?

#7
by adol01 - opened

xiezuo20240926-144824.png
Hope to receive a reply

Sign up or log in to comment