flyingfishinwater
/

good_and_small_models

GGUF

Inference Endpoints

imatrix

conversational

Model card Files Files and versions Community

flyingfishinwater commited on Jun 7, 2024

Commit

c265ba7

verified ·

1 Parent(s): 37e6212

Update README.md

Browse files

Files changed (1) hide show

README.md +63 -21

README.md CHANGED Viewed

@@ -1,6 +1,3 @@
----
-license: apache-2.0
----
 # Llama3 8B
 Llama 3 is the latest and most advanced LLM trained over 15T tokens, which improves its comprehension and handling of complex language nuances. It features an extended context window of 8k tokens allowing the model to access more information from lengthy passages for more informed decision-making.
@@ -178,6 +175,7 @@ OpenChat is an innovative library of open-source language models, fine-tuned wit
 **Prompt Format:**
 ```
 GPT4 Correct User: {{prompt}}<|end_of_turn|>GPT4 Correct Assistant:
 ```
@@ -395,15 +393,15 @@ Chinese Tiny LLM 2B 是首个以中文为中心的大型语言模型，主要在
 ---
-# Qwen1.5 4B Chat
 Qwen is the large language model and large multimodal model series of the Qwen Team, Alibaba Group. It supports both Chinese and English. 通义千问是阿里巴巴公司开发的大大预言模型，支持中英文双语。
-**Model Intention:** It's one of the best LLM that supports Chinese and English. 这是支持中英双语的最佳的大语言模型之一。
-**Model URL:** [https://huggingface.co/flyingfishinwater/good_and_small_models/resolve/main/qwen1_5-4b-chat-q4_k_m.gguf?download=true](https://huggingface.co/flyingfishinwater/good_and_small_models/resolve/main/qwen1_5-4b-chat-q4_k_m.gguf?download=true)
-**Model Info URL:** [https://huggingface.co/Qwen/Qwen1.5-4B-Chat-GGUF](https://huggingface.co/Qwen/Qwen1.5-4B-Chat-GGUF)
 **Model License:** [License Info](https://huggingface.co/Qwen/Qwen1.5-4B-Chat/raw/main/LICENSE)
@@ -411,16 +409,59 @@ Qwen is the large language model and large multimodal model series of the Qwen T
 **Developer:** [https://qwenlm.github.io/](https://qwenlm.github.io/)
-**File Size:** 2460 MB
-**Context Length:** 32768 tokens
 **Prompt Format:**
 ```
-<|im_start|>user
-{{prompt}}
-<|im_end|>
 <|im_start|>assistant
 ```
@@ -436,15 +477,15 @@ Qwen is the large language model and large multimodal model series of the Qwen T
 ---
-# Dophin 2.8 Mistralv02 7B
 This model is based on Mistral-7b-v0.2 with 16k context lengths. It's a uncensored model and supports a variety of instruction, conversational, and coding skills.
 **Model Intention:** It's a uncensored and good skilled English modal best for high performance iPhone, iPad & Mac
-**Model URL:** [https://huggingface.co/flyingfishinwater/good_and_small_models/resolve/main/dolphin-2.8-mistral-7b-v02-Q2_K.gguf?download=true](https://huggingface.co/flyingfishinwater/good_and_small_models/resolve/main/dolphin-2.8-mistral-7b-v02-Q2_K.gguf?download=true)
-**Model Info URL:** [https://huggingface.co/cognitivecomputations/dolphin-2.8-mistral-7b-v02](https://huggingface.co/cognitivecomputations/dolphin-2.8-mistral-7b-v02)
 **Model License:** [License Info](https://www.apache.org/licenses/LICENSE-2.0)
@@ -452,16 +493,17 @@ This model is based on Mistral-7b-v0.2 with 16k context lengths. It's a uncensor
 **Developer:** [https://erichartford.com/](https://erichartford.com/)
-**File Size:** 2728 MB
-**Context Length:** 32768 tokens
 **Prompt Format:**
 ```
-<s><|im_start|>user
-{{prompt}}
-<|im_end|>
 <|im_start|>assistant
 ```
@@ -511,4 +553,4 @@ ASSISTANT:
 **Add EOS Token:** No
-**Parse Special Tokens:** Yes

 # Llama3 8B
 Llama 3 is the latest and most advanced LLM trained over 15T tokens, which improves its comprehension and handling of complex language nuances. It features an extended context window of 8k tokens allowing the model to access more information from lengthy passages for more informed decision-making.
 **Prompt Format:**
 ```
+{{system}}
 GPT4 Correct User: {{prompt}}<|end_of_turn|>GPT4 Correct Assistant:
 ```
 ---
+# Qwen2 7B Chat
 Qwen is the large language model and large multimodal model series of the Qwen Team, Alibaba Group. It supports both Chinese and English. 通义千问是阿里巴巴公司开发的大大预言模型，支持中英文双语。
+**Model Intention:** Qwen2 is the new series that generally surpassed most opensource models and demonstrated competitiveness against proprietary models across a series of benchmarks targeting for language understanding, language generation, multilingual capability, coding, mathematics, reasoning, etc
+**Model URL:** [https://huggingface.co/flyingfishinwater/good_and_small_models/resolve/main/Qwen2-7B-Instruct-Q3_K_S.gguf?download=true](https://huggingface.co/flyingfishinwater/good_and_small_models/resolve/main/Qwen2-7B-Instruct-Q3_K_S.gguf?download=true)
+**Model Info URL:** [https://huggingface.co/Qwen/Qwen2-7B-Instruct-GGUF](https://huggingface.co/Qwen/Qwen2-7B-Instruct-GGUF)
 **Model License:** [License Info](https://huggingface.co/Qwen/Qwen1.5-4B-Chat/raw/main/LICENSE)
 **Developer:** [https://qwenlm.github.io/](https://qwenlm.github.io/)
+**File Size:** 3490 MB
+**Context Length:** 2048 tokens
 **Prompt Format:**
 ```
+<|im_start|>system
+{{system_prompt}}<|im_end|>
+<|im_start|>
+{{prompt}}<|im_end|>
+<|im_start|>assistant
+```
+**Template Name:** chatml
+**Add BOS Token:** Yes
+**Add EOS Token:** No
+**Parse Special Tokens:** Yes
+---
+# Qwen2 1.5B Chat
+Qwen is the large language model and large multimodal model series of the Qwen Team, Alibaba Group. It supports both Chinese and English. 通义千问是阿里巴巴公司开发的大大预言模型，支持中英文双语。
+**Model Intention:** Qwen2 is the new series that generally surpassed most opensource models and demonstrated competitiveness against proprietary models across a series of benchmarks targeting for language understanding, language generation, multilingual capability, coding, mathematics, reasoning, etc
+**Model URL:** [https://huggingface.co/flyingfishinwater/good_and_small_models/resolve/main/Qwen2-1.5B-Instruct.Q4_K_M.gguf?download=true](https://huggingface.co/flyingfishinwater/good_and_small_models/resolve/main/Qwen2-1.5B-Instruct.Q4_K_M.gguf?download=true)
+**Model Info URL:** [https://huggingface.co/Qwen/Qwen2-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2-1.5B-Instruct)
+**Model License:** [License Info](https://huggingface.co/Qwen/Qwen1.5-4B-Chat/raw/main/LICENSE)
+**Model Description:** Qwen is the large language model and large multimodal model series of the Qwen Team, Alibaba Group. It supports both Chinese and English. 通义千问是阿里巴巴公司开发的大大预言模型，支持中英文双语。
+**Developer:** [https://qwenlm.github.io/](https://qwenlm.github.io/)
+**File Size:** 986 MB
+**Context Length:** 2048 tokens
+**Prompt Format:**
+```
+<|im_start|>system
+{{system_prompt}}<|im_end|>
+<|im_start|>
+{{prompt}}<|im_end|>
 <|im_start|>assistant
 ```
 ---
+# Dophin 2.9.2 Qwen2 7B
 This model is based on Mistral-7b-v0.2 with 16k context lengths. It's a uncensored model and supports a variety of instruction, conversational, and coding skills.
 **Model Intention:** It's a uncensored and good skilled English modal best for high performance iPhone, iPad & Mac
+**Model URL:** [https://huggingface.co/flyingfishinwater/good_and_small_models/resolve/main/dolphin-2.9.2-qwen2-7b-Q3_K_S.gguf?download=true](https://huggingface.co/flyingfishinwater/good_and_small_models/resolve/main/dolphin-2.9.2-qwen2-7b-Q3_K_S.gguf?download=true)
+**Model Info URL:** [https://huggingface.co/cognitivecomputations/dolphin-2.9.2-qwen2-7b-gguf](https://huggingface.co/cognitivecomputations/dolphin-2.9.2-qwen2-7b-gguf)
 **Model License:** [License Info](https://www.apache.org/licenses/LICENSE-2.0)
 **Developer:** [https://erichartford.com/](https://erichartford.com/)
+**File Size:** 3490 MB
+**Context Length:** 2048 tokens
 **Prompt Format:**
 ```
+<|im_start|>system
+{{system_prompt}}<|im_end|>
+<|im_start|>user
+{{prompt}}<|im_end|>
 <|im_start|>assistant
 ```
 **Add EOS Token:** No
+**Parse Special Tokens:** Yes