---
base_model: llm-jp/llm-jp-3-13b
tags:
- text-generation-inference
- transformers
- unsloth
- llama
- trl
license: llama3.1
language:
- en
- ja
datasets:
- ichikara-instruction-003-001-1.json
---

# 松尾研_最終提出スコアのRead Meは、以下よりご確認ください。（URLの設定間違いがあった場合用）
- ayayana/llm-jp-3-13b-dpo_ayana10
https://huggingface.co/ayayana/llm-jp-3-13b-dpo_ayana10/blob/main/README.md


# Uploaded  model
- llm-jp-3-13b-ayanatest_lora：llm-jp/llm-jp-3-13bをベースに、ichikara003-001-1データセットでSFTしたlaraモデル

- **Developed by:** ayayana
- **License:** apache-2.0
- **Finetuned from model :** llm-jp/llm-jp-3-13b
- 練習用研究用のモデル
- 利用したデータセットの関係で商用禁止

- 松尾研LLM講座にて、演習に利用したモデルになります。
- LoRA_template_unsloth_20241127.ipynb を利用し、エポックを３に調整
- A100利用で30分ほど


This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.

[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)