Context Length

by FineMist - opened Jul 1

Jul 1

Such a good model but I notice every llama 3 model starts to develop Shakespearean language once it get to a context length of 16k - 32k. Is it unavoidable? My settings follow whatever the models suggest.

Voltedger

Jul 2

Unavoidable, they are not trained to go past 8k context after all.

FineMist

Jul 3

•

edited Jul 3

Bummer. 😕 Still, this model is amazing. I like it even more than Stheno.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment