MaziyarPanahi
/

calme-2.3-qwen2-72b

+---
+language:
+- en
+license: other
+library_name: transformers
+tags:
+- chat
+- qwen
+- qwen2
+- calme
+- calme2
+- finetune
+- chatml
+base_model: Qwen/Qwen2-72B
+license_name: tongyi-qianwen
+license_link: https://huggingface.co/Qwen/Qwen2-72B/blob/main/LICENSE
+pipeline_tag: text-generation
+inference: false
+model_creator: MaziyarPanahi
+quantized_by: MaziyarPanahi
+---
+<img src="./calme-2.webp" alt="Qwen2 fine-tune" width="800" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
+# MaziyarPanahi/calme-2.3-qwen2-72b
+This model is a fine-tuned version of the powerful `Qwen/Qwen2-72B-Instruct`, pushing the boundaries of natural language understanding and generation even further. My goal was to create a versatile and robust model that excels across a wide range of benchmarks and real-world applications.
+## Use Cases
+This model is suitable for a wide range of applications, including but not limited to:
+- Advanced question-answering systems
+- Intelligent chatbots and virtual assistants
+- Content generation and summarization
+- Code generation and analysis
+- Complex problem-solving and decision support
+# ⚡ Quantized GGUF
+All GGUF models are available here: [MaziyarPanahi/calme-2.3-qwen2-72b-GGUF](https://huggingface.co/MaziyarPanahi/calme-2.3-qwen2-72b-GGUF)
+# 🏆 [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
+Leaderboard 2: coming soon!
+|    Tasks     |Version|Filter|n-shot|Metric|Value |   |Stderr|
+|--------------|------:|------|-----:|------|-----:|---|-----:|
+|truthfulqa_mc2|      2|none  |     0|acc   |0.6761|±  |0.0148|
+|  Tasks   |Version|Filter|n-shot|Metric|Value |   |Stderr|
+|----------|------:|------|-----:|------|-----:|---|-----:|
+|winogrande|      1|none  |     5|acc   |0.8248|±  |0.0107|
+|    Tasks    |Version|Filter|n-shot| Metric |Value |   |Stderr|
+|-------------|------:|------|-----:|--------|-----:|---|-----:|
+|arc_challenge|      1|none  |    25|acc     |0.6852|±  |0.0136|
+|             |       |none  |    25|acc_norm|0.7184|±  |0.0131|
+|Tasks|Version|     Filter     |n-shot|  Metric   |Value |   |Stderr|
+|-----|------:|----------------|-----:|-----------|-----:|---|-----:|
+|gsm8k|      3|strict-match    |     5|exact_match|0.8582|±  |0.0096|
+|     |       |flexible-extract|     5|exact_match|0.8893|±  |0.0086|
+# Prompt Template
+This model uses `ChatML` prompt template:
+```
+<|im_start|>system
+{System}
+<|im_end|>
+<|im_start|>user
+{User}
+<|im_end|>
+<|im_start|>assistant
+{Assistant}
+````
+# How to use
+```python
+# Use a pipeline as a high-level helper
+from transformers import pipeline
+messages = [
+    {"role": "user", "content": "Who are you?"},
+]
+pipe = pipeline("text-generation", model="MaziyarPanahi/calme-2.3-qwen2-72b")
+pipe(messages)
+# Load model directly
+from transformers import AutoTokenizer, AutoModelForCausalLM
+tokenizer = AutoTokenizer.from_pretrained("MaziyarPanahi/calme-2.3-qwen2-72b")
+model = AutoModelForCausalLM.from_pretrained("MaziyarPanahi/calme-2.3-qwen2-72b")
+```
+# Ethical Considerations
+As with any large language model, users should be aware of potential biases and limitations. We recommend implementing appropriate safeguards and human oversight when deploying this model in production environments.