Excellent models !!! - Plans for Mistral Nemo and/or Gemma 2 Distills ?
#14 opened about 9 hours ago
by
DavidAU
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65ea44635b64331c067d3751/yCim-7c3tm67o5wWP_6cE.jpeg)
Adding Evaluation Results
#12 opened 6 days ago
by
Mikhil-jivus
Missing multilanguage capabilities
5
#11 opened 7 days ago
by
h4rz3rk4s3
E-MOBI / EKONOMIK MOBIL
#10 opened 10 days ago
by
jesus-christ666
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/qM7L49q77Ff1vlA1hpF0O.jpeg)
run in colab t4
#9 opened 11 days ago
by
rakmik
Adding Evaluation Results
#8 opened 11 days ago
by
T145
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/rv3XTyO6TSLNmebutG9wy.png)
Add pipeline tag, link to paper
#7 opened 14 days ago
by
nielsr
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1608042047613-5f1158120c833276f61f1a84.jpeg)
Do the distilled models also have 128K context?
1
#4 opened 17 days ago
by
Troyanovsky
How was this quantized?
1
#3 opened 17 days ago
by
imq
missing special_tokens_map.json file
#2 opened 17 days ago
by
vince62s
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6495b47a74ce69cc4eab61f0/2eg17fMXjshpfQfSq5jyP.png)