Text Generation
Transformers
Safetensors
llama
galore
Inference Endpoints
text-generation-inference
Edit model card

Basic Model Info

1 epoch on adamo1139/uninstruct-v1-experimental-chatml and then 1 epoch on adamo1139/HESOYAM_v0.3. I used GaLore for both stages.

This is a model trained on only human data, finetuned to behave like a person on 4chan board /x/ or redditor. Data used has comments from 1 4chan board "paranormal" and about 10 reddit subreddits. There's also a pippa in case you want to roleplay. Have a look at dataset to know what to expect.

Use ChatML prompt format with a system prompt like those in adamo1139/HESOYAM_v0.3, so A chat on 4chan or A chat on subreddit /r/wallstreetbets. It behaves like OpenAI slopped model with system prompt A chat so I advise you to avoid using that.

Downloads last month
0
Safetensors
Model size
34.4B params
Tensor type
FP16
·
Inference API
Model is too large to load in Inference API (serverless). To try the model, launch it on Inference Endpoints (dedicated) instead.

Datasets used to train adamo1139/Yi-34B-200K-HESOYAM-2206