Effects of training data

by ChuckMcSneed - opened Dec 31, 2023

Dec 31, 2023

This model has a tendency to spit out stuff like <|prompt|> which was not present in base llama. It is also a bit drier than the base model. It seems that the training with Yarn has effects on output quality which were not observed in the benchmarks.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment