-
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs
Paper • 2402.12030 • Published -
mistralai/Mistral-7B-Instruct-v0.2
Text Generation • Updated • 767k • 2.39k -
meta-llama/Llama-2-7b-chat-hf
Text Generation • Updated • 1,000k • 3.65k -
EleutherAI/pythia-160m-deduped
Text Generation • Updated • 25.3k • 3
Nicolas-BZRD
Nicolas-BZRD
AI & ML interests
PhD Student | NLP - LLMs - Adaptation real-world problem
Optimization
Organizations
Collections
1
models
92
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62be186a5f59ff2320e6e32b/W_emoC2uItM-MJZyCfIKI.png)
Nicolas-BZRD/mt0-base_dialogsum_Mistral-7B-Instruct-v0.2_uld_loss
Text2Text Generation
•
Updated
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62be186a5f59ff2320e6e32b/W_emoC2uItM-MJZyCfIKI.png)
Nicolas-BZRD/mt0-base_dialogsum_Mistral-7B-Instruct-v0.2_text_teacher
Text2Text Generation
•
Updated
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62be186a5f59ff2320e6e32b/W_emoC2uItM-MJZyCfIKI.png)
Nicolas-BZRD/mt0-base_pubmed_qa_Llama-2-7b-chat-hf_uld_loss
Text2Text Generation
•
Updated
•
5
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62be186a5f59ff2320e6e32b/W_emoC2uItM-MJZyCfIKI.png)
Nicolas-BZRD/mt0-base_pubmed_qa_Llama-2-7b-chat-hf_text_teacher
Text2Text Generation
•
Updated
•
11
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62be186a5f59ff2320e6e32b/W_emoC2uItM-MJZyCfIKI.png)
Nicolas-BZRD/mt0-base_qed_Llama-2-7b-chat-hf_uld_loss
Text2Text Generation
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62be186a5f59ff2320e6e32b/W_emoC2uItM-MJZyCfIKI.png)
Nicolas-BZRD/mt0-base_qed_Llama-2-7b-chat-hf_text_teacher
Text2Text Generation
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62be186a5f59ff2320e6e32b/W_emoC2uItM-MJZyCfIKI.png)
Nicolas-BZRD/mt0-base_dialogsum_Llama-2-7b-chat-hf_uld_loss
Text2Text Generation
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62be186a5f59ff2320e6e32b/W_emoC2uItM-MJZyCfIKI.png)
Nicolas-BZRD/mt0-base_dialogsum_Llama-2-7b-chat-hf_text_teacher
Text2Text Generation
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62be186a5f59ff2320e6e32b/W_emoC2uItM-MJZyCfIKI.png)
Nicolas-BZRD/pythia-160m-deduped_FairytaleQA_Llama-2-7b-chat-hf_uld_loss
Text Generation
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62be186a5f59ff2320e6e32b/W_emoC2uItM-MJZyCfIKI.png)
Nicolas-BZRD/pythia-160m-deduped_FairytaleQA_Llama-2-7b-chat-hf_text_teacher
Text Generation
•
Updated
datasets
30
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-pubmed_qa_50k
Viewer
•
Updated
•
50.5k
Nicolas-BZRD/uld_loss_Llama-2-7b-chat-hf-squad
Viewer
•
Updated
•
87.6k
•
1
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-squad
Viewer
•
Updated
•
87.6k
Nicolas-BZRD/uld_loss_Llama-2-7b-chat-hf-dialogsum
Viewer
•
Updated
•
12.4k
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-qed
Viewer
•
Updated
•
7.62k
•
1
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-FairytaleQA
Viewer
•
Updated
•
9.57k
•
2
Nicolas-BZRD/uld_loss_Llama-2-7b-chat-hf-FairytaleQA
Viewer
•
Updated
•
9.57k
•
3
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-dialogsum
Viewer
•
Updated
•
13k
•
2
Nicolas-BZRD/uld_loss_Llama-2-7b-chat-hf-qed
Viewer
•
Updated
•
7.62k
•
1
Nicolas-BZRD/uld_loss_Llama-2-7b-chat-hf-pubmed_qa_50k
Viewer
•
Updated
•
50.5k
•
4