rio-codes
/

Mixtral_Rio_oasst2_v1

Text Generation

Generated from Trainer

Model card Files Files and versions Community

rio-codes commited on Feb 5

Commit

4acf107

•

1 Parent(s): 23a7414

Update README.md

Files changed (1) hide show

README.md +11 -1

README.md CHANGED Viewed

@@ -5,6 +5,7 @@ tags:
 - trl
 - sft
 - generated_from_trainer
 base_model: mistralai/Mixtral-8x7B-v0.1
 model-index:
 - name: Mixtral_Rio_oasst2_v1
@@ -27,7 +28,16 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
 ## Intended uses & limitations

 - trl
 - sft
 - generated_from_trainer
+- text-generation
 base_model: mistralai/Mixtral-8x7B-v0.1
 model-index:
 - name: Mixtral_Rio_oasst2_v1
 ## Model description
+This is a LoRA trained on OpenAssistant data.
+The settings for the base model should be:
+Model loader: Transformers
+Compute_dtype: bfloat16
+quant_type: nf4
+cpu: enabled
+load-in-4bit: enabled
+use_double_quant: enabled
+set GPU memory as high as possible unless running locally to give some space for your desktop environment
+tweak CPU usage until it loads successfully
 ## Intended uses & limitations