File size: 1,975 Bytes
72f81f5
 
 
0e173ca
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
---
license: apache-2.0
---
---
license: apache-2.0
---
# Model url: https://huggingface.co/TimeMobius/Mobius-RWKV-r5-chat-12B-8k
Considering the long context required for training from scratch, we decided to retrain the r5 12B model from 8k.
This model exhibits lower diversity compared to its predecessor, but it excels in following instructions and  logical understanding. It is possible to utilize both models simultaneously as multi-agents, each performing a different task.

# Mobius RWKV r5 chat 12B 8k
Mobius is a RWKV v5.2 arch chat model, benifit from [Matrix-Valued States and Dynamic Recurrence](https://arxiv.org/abs/2404.05892)

## Introduction

Mobius is a RWKV v5.2 arch model, a state based RNN+CNN+Transformer Mixed language model pretrained on a certain amount of data.
In comparison with the previous released Mobius, the improvements include:

* Only 24G Vram to run this model locally with fp16;
* Significant performance improvement;
* Multilingual support ;
* Stable support of 128K context length.
* Base model [Mobius-mega-12B-128k-base](https://huggingface.co/TimeMobius/Moibus-mega-12B-128k-base)
  

## Usage
We encourage you use few shots to use this model, Desipte Directly use User: xxxx\n\nAssistant: xxx\n\n is really good too, Can boost all potential ability. 

Recommend Temp and topp: 0.7 0.6/1 0.3/1.5 0.3/0.2 0.8

## More details
Mobius 12B 128k based on RWKV v5.2 arch, which is leading state based RNN+CNN+Transformer Mixed large language model which focus opensouce community
* 10~100 trainning/inference cost reduce;
* state based,selected memory, which mean good at grok;
* community support.

## requirements
24G vram to run fp16, 12G for int8, 6G for nf4 with Ai00 server.

* [RWKV Runner](https://github.com/josStorer/RWKV-Runner)
* [Ai00 server](https://github.com/cgisky1980/ai00_rwkv_server)

## future plan
If you need a HF version let us know

[Mobius-Chat-12B-128k](https://huggingface.co/TimeMobius/Mobius-Chat-12B-128k)