infering by multi-model session but get wrong output

by enlei - opened 8 days ago

Discussion

enlei

8 days ago

loading ChatML-preset qwen2 in mutlti-model session，get wrong putput by inputing "what can you do?"

as shown in the image below

SFoNX

8 days ago

When you use this model in LM Studio - you need to use the included ChatML preset.
Then in Settings (Right hand side chat screen) Go to -> Model Initialization -> Flash Attention -> Turn it on

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment