syzymon
/

long_llama_code_7b

Text Generation

text-generation-inference

Model card Files Files and versions Community

syzymon commited on Sep 21, 2023

Commit

7e37e56

•

1 Parent(s): 239069a

Update README.md

Files changed (1) hide show

README.md +3 -5

README.md CHANGED Viewed

@@ -66,7 +66,9 @@ model-index:
 </div>
 ## TLDR
 This repository contains the research preview of **LongLLaMA, a large language model capable of handling long contexts of 256k tokens or even more**.
@@ -84,10 +86,6 @@ LongLLaMA Code is built upon the foundation of [Code Llama](https://huggingface.
 with three layers used for context extension. **Crucially, LongLLaMA is able to extrapolate much beyond the context length seen in training: 8k. E.g., in the passkey retrieval task, it can handle inputs of length 256k**.
 **LongLLaMA Code** is a [Code Llama](https://huggingface.co/codellama/CodeLlama-7b-hf) model finetuned with the FoT method.
-<p align="center" width="100%">
-<img src="https://raw.githubusercontent.com/CStanKonrad/long_llama/main/assets/results.png" alt="LongLLaMA" style="width: 70%; min-width: 300px; display: block; margin: auto;">
-</p>
 <div align="center">

 </div>
+<p align="center" width="100%">
+<img src="https://raw.githubusercontent.com/CStanKonrad/long_llama/main/assets/results.png" alt="LongLLaMA" style="width: 70%; min-width: 300px; display: block; margin: auto;">
+</p>
 ## TLDR
 This repository contains the research preview of **LongLLaMA, a large language model capable of handling long contexts of 256k tokens or even more**.
 with three layers used for context extension. **Crucially, LongLLaMA is able to extrapolate much beyond the context length seen in training: 8k. E.g., in the passkey retrieval task, it can handle inputs of length 256k**.
 **LongLLaMA Code** is a [Code Llama](https://huggingface.co/codellama/CodeLlama-7b-hf) model finetuned with the FoT method.
 <div align="center">