keitokei1994 commited on
Commit
51db3c9
1 Parent(s): 8a28601

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -16,14 +16,14 @@ tags:
16
  - **モデル名**: swallow-3-8B-sqlcoder-2x8B-GGUF
17
  - **モデルアーキテクチャ**: Mixture of Experts (MoE)
18
  - **ベースモデル**:
19
- - [tokyotech-llm/Llama-3-Swallow-8B-Instruct-v0.1](https://huggingface.co/tokyotech-llm/Llama-3-Swallow-8B-Instruct-v0.1)
20
  - [defog/llama-3-sqlcoder-8b](https://huggingface.co/defog/llama-3-sqlcoder-8b)
21
  - **マージツール**: MergeKit
22
 
23
- このMoEモデルは、Llama-3-Swallow-8B-Instruct-v0.1の日本語能力とLlama-3-sqlcoder-8bのSQL生成能力を組み合わせることで、より強力で多機能な言語モデルを目指しています。
24
  #### 特徴
25
  - 日本語と英語の両方に対応
26
- - Llama-3-Swallow-8B-Instruct-v0.1による優れた日本語処理能力
27
  - Llama-3-sqlcoder-8bによる高度なSQL生成と処理能力
28
  #### 要求スペック
29
  Q4_K_M量子化モデルであれば、RTX3060 12GBでフルロード可能です。
@@ -37,14 +37,14 @@ The gguf version can be found [こちら](https://huggingface.co/keitokei1994/sw
37
  - **Model Name**: swallow-3-8B-sqlcoder-2x8B-GGUF
38
  - **Model Architecture**: Mixture of Experts (MoE)
39
  - **Base Models**:
40
- - [tokyotech-llm/Llama-3-Swallow-8B-Instruct-v0.1](https://huggingface.co/tokyotech-llm/Llama-3-Swallow-8B-Instruct-v0.1)
41
  - [defog/llama-3-sqlcoder-8b](https://huggingface.co/defog/llama-3-sqlcoder-8b)
42
  - **Merge Tool**: MergeKit
43
 
44
- This MoE model aims to create a more powerful and versatile language model by combining the Japanese language capabilities of Llama-3-Swallow-8B-Instruct-v0.1 with the SQL generation abilities of Llama-3-sqlcoder-8b.
45
  #### Features
46
  - Support for both Japanese and English languages
47
- - Excellent Japanese processing capabilities from Llama-3-Swallow-8B-Instruct-v0.1
48
  - Advanced SQL generation and processing capabilities from Llama-3-sqlcoder-8b
49
  #### System Requirements
50
  If using the Q4_K_M quantized model, it can be fully loaded on an RTX3060 12GB.
 
16
  - **モデル名**: swallow-3-8B-sqlcoder-2x8B-GGUF
17
  - **モデルアーキテクチャ**: Mixture of Experts (MoE)
18
  - **ベースモデル**:
19
+ - [aixsatoshi/Llama3-Swallow-8B-instruct-vector-merged](https://huggingface.co/aixsatoshi/Llama3-Swallow-8B-instruct-vector-merged)
20
  - [defog/llama-3-sqlcoder-8b](https://huggingface.co/defog/llama-3-sqlcoder-8b)
21
  - **マージツール**: MergeKit
22
 
23
+ このMoEモデルは、Llama3-Swallow-8B-instruct-vector-mergedの日本語能力とLlama-3-sqlcoder-8bのSQL生成能力を組み合わせることで、より強力で多機能な言語モデルを目指しています。
24
  #### 特徴
25
  - 日本語と英語の両方に対応
26
+ - Llama3-Swallow-8B-instruct-vector-mergedによる優れた日本語処理能力
27
  - Llama-3-sqlcoder-8bによる高度なSQL生成と処理能力
28
  #### 要求スペック
29
  Q4_K_M量子化モデルであれば、RTX3060 12GBでフルロード可能です。
 
37
  - **Model Name**: swallow-3-8B-sqlcoder-2x8B-GGUF
38
  - **Model Architecture**: Mixture of Experts (MoE)
39
  - **Base Models**:
40
+ - [aixsatoshi/Llama3-Swallow-8B-instruct-vector-merged](https://huggingface.co/aixsatoshi/Llama3-Swallow-8B-instruct-vector-merged)
41
  - [defog/llama-3-sqlcoder-8b](https://huggingface.co/defog/llama-3-sqlcoder-8b)
42
  - **Merge Tool**: MergeKit
43
 
44
+ This MoE model aims to create a more powerful and versatile language model by combining the Japanese language capabilities of Llama3-Swallow-8B-instruct-vector-merged with the SQL generation abilities of Llama-3-sqlcoder-8b.
45
  #### Features
46
  - Support for both Japanese and English languages
47
+ - Excellent Japanese processing capabilities from Llama3-Swallow-8B-instruct-vector-merged
48
  - Advanced SQL generation and processing capabilities from Llama-3-sqlcoder-8b
49
  #### System Requirements
50
  If using the Q4_K_M quantized model, it can be fully loaded on an RTX3060 12GB.