yang31210999
/

Llama3.1-1B-Neo-BAAI-1000k

@@ -1,13 +1,19 @@
 ---
-license: llama3
-datasets:
-- BAAI/Infinity-Instruct
 base_model:
 - meta-llama/Meta-Llama-3.1-8B-Instruct
 ---
 We prune the Llama-3.1-8B-Instruct to 1.4B and fine-tune it with LLM-Neo method，which combines LoRA and KD in one. Training data is sampling from BAAI/Infinity-Instruct for 1 Million lines.
 ## Benchmarks
 In this section, we report the results for Llama3.1-Neo-1B-100w on standard automatic benchmarks. For all the evaluations, we use [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) library.

 ---
 base_model:
 - meta-llama/Meta-Llama-3.1-8B-Instruct
+datasets:
+- BAAI/Infinity-Instruct
+license: apache-2.0
+library_name: transformers
+pipeline_tag: text-generation
 ---
 We prune the Llama-3.1-8B-Instruct to 1.4B and fine-tune it with LLM-Neo method，which combines LoRA and KD in one. Training data is sampling from BAAI/Infinity-Instruct for 1 Million lines.
+For more information, please refer to the paper: [LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models](https://huggingface.co/papers/2411.06839)
+Code can be found here: https://github.com/yang3121099/LLM-Neo
 ## Benchmarks
 In this section, we report the results for Llama3.1-Neo-1B-100w on standard automatic benchmarks. For all the evaluations, we use [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) library.