xiaol
/

rwkv-7B-world-novel-128k

Model card Files Files and versions Community

xiaol commited on Aug 10, 2023

Commit

0f64ecb

•

1 Parent(s): 87c0275

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ With RWKV world tokenizer,multi-langs have 1:1 tokenization ratio ,one word to o
 This model trained with instructions datasets and chinese web novel and tradition wuxia,
 more trainning details would be updated.
-Test input 67k tokens  to summary ,can find conversation files in example folders ,more cases are coming.
 Full finetuned using this repo to train 128k context model , 4*A800 40hours with 1.3B tokens.
 https://github.com/SynthiaDL/TrainChatGalRWKV/blob/main/train_world.sh
@@ -35,7 +35,7 @@ Using RWKV Runner https://github.com/josStorer/RWKV-Runner  to test this ， use
 ![image.png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/b_6KCBdZKW7Q7HwipxE-l.png)
-67k input test
 ![image.png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/F9unOJfhmJPXsciPHLsrl.png)
 ![image.png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/35j5C1QD_4cO-AjfxV7tl.png)

 This model trained with instructions datasets and chinese web novel and tradition wuxia,
 more trainning details would be updated.
+Test input 85k tokens  to summary ,can find conversation files in example folders ,more cases are coming.
 Full finetuned using this repo to train 128k context model , 4*A800 40hours with 1.3B tokens.
 https://github.com/SynthiaDL/TrainChatGalRWKV/blob/main/train_world.sh
 ![image.png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/b_6KCBdZKW7Q7HwipxE-l.png)
+85k input test
 ![image.png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/F9unOJfhmJPXsciPHLsrl.png)
 ![image.png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/35j5C1QD_4cO-AjfxV7tl.png)