wordcab
/

llama-natural-instructions-7b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

chainyo commited on Apr 7, 2023

Commit

b3349e0

·

1 Parent(s): aac22d6

add 13B model link

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -168,6 +168,8 @@ We benchmarked our model on the following tasks: [BoolQ](https://huggingface.co/
 | LoRA LLaMA 7B | 63.9 | 51.3 | 48.9 | 31.4 | 8bit | 0.65 seconds |
 | LoRA LLaMA 13B | 70 | 63.93 | 51.6 | 50.4 | 8bit | 1.2 seconds |
 Overall our LoRA model is less performant than the original model from Meta, if we compare the results from the [original paper](https://arxiv.org/pdf/2302.13971.pdf).
 The performance degradation is due to the fact we load the model in 8bit and we use the adapters from the LoRA training.

 | LoRA LLaMA 7B | 63.9 | 51.3 | 48.9 | 31.4 | 8bit | 0.65 seconds |
 | LoRA LLaMA 13B | 70 | 63.93 | 51.6 | 50.4 | 8bit | 1.2 seconds |
+__Link to the 13B model:__ [wordcab/llama-natural-instructions-13b](https://huggingface.co/wordcab/llama-natural-instructions-13b)
 Overall our LoRA model is less performant than the original model from Meta, if we compare the results from the [original paper](https://arxiv.org/pdf/2302.13971.pdf).
 The performance degradation is due to the fact we load the model in 8bit and we use the adapters from the LoRA training.