mlabonne/phixtral-2x2_8 · Unable to replicate using LazyMergeKit Colab

sudhir2016

Jan 17, 2024

I tried to replicate using your LazyMergeKit notebook as a learning exercise. This yaml_config gives an error that positive_prompts are same for both experts.
base_model: cognitivecomputations/dolphin-2_6-phi-2
gate_mode: cheap_embed
experts:

source_model: cognitivecomputations/dolphin-2_6-phi-2
positive_prompts: [""]
source_model: lxuechen/phi-2-dpo
positive_prompts: [""]

Then I changed yaml_config like this
MODEL_NAME = "Phixtral-Merge"
yaml_config = """
base_model: cognitivecomputations/dolphin-2_6-phi-2
gate_mode: cheap_embed
experts:

source_model: cognitivecomputations/dolphin-2_6-phi-2
positive_prompts: ["code"]
source_model: lxuechen/phi-2-dpo
positive_prompts: ["math"]
"""
This time I got this error.
File "/content/mergekit/mergekit/io/lazy_tensor_loader.py", line 127, in get_tensor
raise KeyError(key)
KeyError: 'model.embed_tokens.weight'

Please help.

mlabonne

Owner Jan 18, 2024

Sorry that's normal, I modified mergekit's code to produce phixtral. This branch hasn't been released yet (still need to work on it).

sudhir2016

Jan 18, 2024

OK thanks.

sudhir2016 changed discussion status to closed Jan 18, 2024