REILX
/

Llama-3-8B-Instruct-Tiger-alpaca-chinese-lora

Model card Files Files and versions Community

基于alpaca-data-gpt4-chinese、sft_zh数据集对Llama-3-8B-Instruct进行微调。

模型：

https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct

数据集：

训练工具

https://github.com/hiyouga/LLaMA-Factory

测评方式：

使用opencompass(https://github.com/open-compass/OpenCompass/ )，测试工具基于CEval和MMLU对微调之后的模型和原始模型进行测试。
测试模型分别为：

Llama-3-8B
Llama-3-8B-Instruct
LLama3-Instruct-sft-lora-tigerbot-alpacadatagpt4,使用sft_zh、alpaca-data-gpt4-chinese数据对Llama-3-8B-Instruct使用sft方式lora微调

结果

模型名称	CEVAL	MMLU
LLama3	49.91	66.62
LLama3-Instruct	50.55	67.15
LLama3-Instruct-sft-lora-tigerbot-alpacadatagpt4	53.65	68.09

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Datasets used to train REILX/Llama-3-8B-Instruct-Tiger-alpaca-chinese-lora

Collection including REILX/Llama-3-8B-Instruct-Tiger-alpaca-chinese-lora

Llama3-SFT

A series of fine-tuned models based on the Llama model • 5 items • Updated Jul 9, 2024