Edit model card

merge

This is a merge of pre-trained language models created using mergekit. Model merge (slerp) based on lmsys/vicuna-7b-v1.5 and meta-math/MetaMath-Llemma-7B

  1. Vicuna

    Model Details

    Vicuna is a chat assistant trained by fine-tuning Llama 2 on user-shared conversations collected from ShareGPT.

    • Developed by: LMSYS
    • Model type: An auto-regressive language model based on the transformer architecture
    • License: Llama 2 Community License Agreement
    • Finetuned from model: Llama 2

    Model Sources

  2. MetaMath Llemma

    Model Details

    MetaMath-Llemma-7B is fully fine-tuned on the MetaMathQA datasets and based on the powerful Llemma-7B model. It is glad to see using MetaMathQA datasets and change the base model from llama-2-7B to Llemma-7B can boost the MATH performance from 19.8 to 30.0.

Downloads last month
6
Safetensors
Model size
6.74B params
Tensor type
FP16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for taesunwhang/vicuna-metamath-llemma-7b