LoneStriker/Yi-34B-200K-8.0bpw-h8-exl2

Nov 12, 2023

Sorry if this has been asked - do these converted models need the .py files from the original Yi models?

Owner Nov 12, 2023

It's a bit complicated. I believe you need the one for tokenization at a minimum. Earlier fine-tunes of Yi 34B like this one used the original base models as released. For some of the later fine-tunes, they are typically based on the Yi 34b-Llama model that renames two layers from Yi back to what they were in standard Llama. With these later fine-tunes, you do not need to trust remote code to run. But, earlier fine-tunes will need trust remote.

bdambrosio

Nov 12, 2023

•

edited Nov 13, 2023

Ah - I found this thread in the yi github: https://github.com/01-ai/Yi/issues/30
doesn't really clarify, but they claim <|Huma|> etc aren't used in current model, although <|endoftext|> IS important. confusing.

Update - switched to your spicyboros 8bpw model. now trying llama-2 prompt, same one I use w jondurbin's models. still not having much luck. Example prompt:

\n[INST] <>\nyou are a chatty ai.\n<>\nhi. Can I call you Alice? \n[/INST] (note what appears as <> is actually the correct SYS prompt, doesn't make it through markdown_

LoneStriker
/

Yi-34B-200K-8.0bpw-h8-exl2

trust remote code?