More of a proof of concept. Temper your expectations.

Warning: May generate 18+ content. Trained on the dialogue from Adastra (the furry visual novel) in text-generation-webui (by oobabooga), with a modified Training_PRO extension. Model was loaded unquantized (BF16). Currently, loading IA3's works in unmodified textgen-webui (I think... Load it as you would a LoRA) only if you load the model unquantized, and not in 4 or 8 bit.

Other Training parameters:

Add overlapping blocks: On

DEMENTOR (long form learning by FP): On //This might be the secret sauce that makes this IA3 so effective with just 1 epoch of training.

Extra:

Training took 9 minutes on an RTX 3090

If you are the creator of Adastra and would like this taken down, please contact me.

I do not claim to have produced the training data that went into this finetune.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.