Safetensors
MineMA-8B / README.md
Aiwensile2's picture
Update README.md
f4fad50 verified
|
raw
history blame
3.22 kB
---
license: cc-by-4.0
datasets:
- Aiwensile2/Minecraft_QA-pairs_Instruction_Dataset
---
# MineMA: Fine-Tuned Models for Minecraft Q&A
## Overview
In this repository, we present the MineMA series of models, fine-tuned specifically for Minecraft-related Q&A tasks. Utilizing the LoRA method for efficient model fine-tuning, we have adapted pre-trained LLaMA models to respond accurately and effectively to Minecraft-related instructions and queries. Our fine-tuning process leverages the specially generated Minecraft dataset to ensure relevance and accuracy in the Q&A responses.
## Models
The MineMA series includes several models fine-tuned on different base models from the LLaMA series. Below is the list of the fine-tuned models provided in this repository:
- **MineMA-8B**(v1, v2, v3, v4), derived from the base model LLaMA-3-8B-Instruct.
- **MineMA-13B**(v1, v2), derived from the base model LLaMA-2-13B-Chat.
- **MineMA-70B**, derived from the base model LLaMA-3-70B-Instruct.
These models have been fine-tuned by using the **Minecraft_QA-pairs_Instruction_Dataset**. We have only released four models of MineMA-8B for the time being, and we will supplement more models in the future.
## Fine-Tuning Methodology
### LoRA Method for Fine-Tuning
We employed the **LoRA (Low-Rank Adaptation)** method for fine-tuning our models. LoRA is a parameter-efficient training technique that introduces small, trainable low-rank matrices to adapt a pre-trained neural network, allowing for targeted updates without the need for retraining the entire model. This method strikes a balance between computational efficiency and training effectiveness.
### Training Parameters
Here are the specific training parameters:
| Model | lora\_r | lora\_alpha | lora\_dropout | learning\_rate | weight\_decay | Single Round? |
|--------------|---------|-------------|---------------|----------------|---------------|---------------|
| MineMA-13B-v1| 64 | 128 | 0.1 | 1E-04 | 1E-04 | False |
| MineMA-13B-v2| 128 | 256 | 0.1 | 1E-04 | 1E-04 | False |
| MineMA-8B-v1 | 64 | 128 | 0.1 | 1E-04 | 1E-04 | True |
| MineMA-8B-v2 | 32 | 64 | 0.1 | 1E-04 | 1E-04 | False |
| MineMA-8B-v3 | 64 | 128 | 0.1 | 1E-04 | 1E-04 | False |
| MineMA-8B-v4 | 128 | 256 | 0.1 | 1E-04 | 1E-04 | False |
| MineMA-70B | 16 | 32 | 0.1 | 1E-04 | 1E-04 | True |
## Dataset
We used the **Minecraft_QA-pairs_Instruction_Dataset** for fine-tuning all the models in the MineMA series. This dataset has 390,317 instruction entries specifically designed for Minecraft-related Q&A tasks. You can access the dataset via the following link:
[Minecraft_QA-pairs_Instruction_Dataset](https://huggingface.co/datasets/Aiwensile2/Minecraft_QA-pairs_Instruction_Dataset)
## Details
### License
These models are made available under the Creative Commons Attribution 4.0 International License.
### DOI
10.57967/hf/2488