grg's picture
Adding llama_3.2 and updating links
abb889e
raw
history blame contribute delete
863 Bytes
<p>
This open-source model was created by <a target="_blank" href="https://www.microsoft.com/">Microsoft<a>.
You can find the release blog post <a target="_blank" href="https://azure.microsoft.com/en-us/blog/introducing-phi-3-redefining-whats-possible-with-slms/">here</a>.
The model is available on the huggingface hub: <a target="_blank" href="https://huggingface.co/microsoft/Phi-3-mini-128k-instruct">https://huggingface.co/microsoft/Phi-3-mini-128k-instruct</a>.
The model has 16x3.8B parameters with 6.6B active parameters, and supports up to 128K token contexts.
Even though this model supports system messages, we evaluate this model as user-message-only model
(the persona is induced by sending the user message "You are &lt;persona&gt;" followed by a manually set "OK" as the assistant's response)
as it worked better.
</p>