<0x0A>

by Omidh - opened Jan 2, 2024

Jan 2, 2024

•

Not sure what I am doing wrong but while running the model with ollama I get a lot of <0x0A>'s.
Would appreciate any hint.

I replace <0x0A> with \n right now in code but still is it a config error or inconsistency in the training data?

Mar 28, 2024

Look at the vocabulary words (tokenizer). That hex code is like the first token in the model vocabulary?

So, maybe your output biased to returning that token-number due to some coding error, or other issue.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment