keitokei1994
commited on
Commit
•
51db3c9
1
Parent(s):
8a28601
Update README.md
Browse files
README.md
CHANGED
@@ -16,14 +16,14 @@ tags:
|
|
16 |
- **モデル名**: swallow-3-8B-sqlcoder-2x8B-GGUF
|
17 |
- **モデルアーキテクチャ**: Mixture of Experts (MoE)
|
18 |
- **ベースモデル**:
|
19 |
-
- [
|
20 |
- [defog/llama-3-sqlcoder-8b](https://huggingface.co/defog/llama-3-sqlcoder-8b)
|
21 |
- **マージツール**: MergeKit
|
22 |
|
23 |
-
このMoEモデルは、
|
24 |
#### 特徴
|
25 |
- 日本語と英語の両方に対応
|
26 |
-
-
|
27 |
- Llama-3-sqlcoder-8bによる高度なSQL生成と処理能力
|
28 |
#### 要求スペック
|
29 |
Q4_K_M量子化モデルであれば、RTX3060 12GBでフルロード可能です。
|
@@ -37,14 +37,14 @@ The gguf version can be found [こちら](https://huggingface.co/keitokei1994/sw
|
|
37 |
- **Model Name**: swallow-3-8B-sqlcoder-2x8B-GGUF
|
38 |
- **Model Architecture**: Mixture of Experts (MoE)
|
39 |
- **Base Models**:
|
40 |
-
- [
|
41 |
- [defog/llama-3-sqlcoder-8b](https://huggingface.co/defog/llama-3-sqlcoder-8b)
|
42 |
- **Merge Tool**: MergeKit
|
43 |
|
44 |
-
This MoE model aims to create a more powerful and versatile language model by combining the Japanese language capabilities of
|
45 |
#### Features
|
46 |
- Support for both Japanese and English languages
|
47 |
-
- Excellent Japanese processing capabilities from
|
48 |
- Advanced SQL generation and processing capabilities from Llama-3-sqlcoder-8b
|
49 |
#### System Requirements
|
50 |
If using the Q4_K_M quantized model, it can be fully loaded on an RTX3060 12GB.
|
|
|
16 |
- **モデル名**: swallow-3-8B-sqlcoder-2x8B-GGUF
|
17 |
- **モデルアーキテクチャ**: Mixture of Experts (MoE)
|
18 |
- **ベースモデル**:
|
19 |
+
- [aixsatoshi/Llama3-Swallow-8B-instruct-vector-merged](https://huggingface.co/aixsatoshi/Llama3-Swallow-8B-instruct-vector-merged)
|
20 |
- [defog/llama-3-sqlcoder-8b](https://huggingface.co/defog/llama-3-sqlcoder-8b)
|
21 |
- **マージツール**: MergeKit
|
22 |
|
23 |
+
このMoEモデルは、Llama3-Swallow-8B-instruct-vector-mergedの日本語能力とLlama-3-sqlcoder-8bのSQL生成能力を組み合わせることで、より強力で多機能な言語モデルを目指しています。
|
24 |
#### 特徴
|
25 |
- 日本語と英語の両方に対応
|
26 |
+
- Llama3-Swallow-8B-instruct-vector-mergedによる優れた日本語処理能力
|
27 |
- Llama-3-sqlcoder-8bによる高度なSQL生成と処理能力
|
28 |
#### 要求スペック
|
29 |
Q4_K_M量子化モデルであれば、RTX3060 12GBでフルロード可能です。
|
|
|
37 |
- **Model Name**: swallow-3-8B-sqlcoder-2x8B-GGUF
|
38 |
- **Model Architecture**: Mixture of Experts (MoE)
|
39 |
- **Base Models**:
|
40 |
+
- [aixsatoshi/Llama3-Swallow-8B-instruct-vector-merged](https://huggingface.co/aixsatoshi/Llama3-Swallow-8B-instruct-vector-merged)
|
41 |
- [defog/llama-3-sqlcoder-8b](https://huggingface.co/defog/llama-3-sqlcoder-8b)
|
42 |
- **Merge Tool**: MergeKit
|
43 |
|
44 |
+
This MoE model aims to create a more powerful and versatile language model by combining the Japanese language capabilities of Llama3-Swallow-8B-instruct-vector-merged with the SQL generation abilities of Llama-3-sqlcoder-8b.
|
45 |
#### Features
|
46 |
- Support for both Japanese and English languages
|
47 |
+
- Excellent Japanese processing capabilities from Llama3-Swallow-8B-instruct-vector-merged
|
48 |
- Advanced SQL generation and processing capabilities from Llama-3-sqlcoder-8b
|
49 |
#### System Requirements
|
50 |
If using the Q4_K_M quantized model, it can be fully loaded on an RTX3060 12GB.
|