Daemontatox commited on
Commit
c45d1ce
·
verified ·
1 Parent(s): 16abdfe

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +53 -11
README.md CHANGED
@@ -1,22 +1,64 @@
1
  ---
2
- base_model: unsloth/qwen2.5-14b-instruct-bnb-4bit
3
  tags:
4
- - text-generation-inference
5
  - transformers
6
- - unsloth
7
- - qwen2
8
- - trl
9
  license: apache-2.0
10
  language:
11
  - en
 
 
 
 
 
 
 
12
  ---
13
 
14
- # Uploaded model
15
 
16
- - **Developed by:** Daemontatox
17
- - **License:** apache-2.0
18
- - **Finetuned from model :** unsloth/qwen2.5-14b-instruct-bnb-4bit
19
 
20
- This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
21
 
22
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
1
  ---
 
2
  tags:
3
+ - long-cot-reasoning
4
  - transformers
5
+ - mamba2
6
+ - llms
7
+ - chain-of-thought
8
  license: apache-2.0
9
  language:
10
  - en
11
+ datasets:
12
+ - Daemontatox/LongCOT-Reason
13
+ - Daemontatox/alpaca_reasoning_COT
14
+ base_model:
15
+ - Qwen/Qwen2.5-14B-Instruct
16
+ pipeline_tag: text-generation
17
+ library_name: transformers
18
  ---
19
 
20
+ ![Sphinx of Reasoning](./Sphinx.jpg)
21
 
22
+ # **Sphinx: A Long Chain-of-Thought Reasoning Model**
 
 
23
 
24
+ - **Developed by:** Daemontatox
25
+ - **License:** Apache-2.0
26
+ - **Base Model:** Fine-tuned from `unsloth/qwen2.5-14b-instruct-bnb-4bit`
27
+ - **Accelerated by:** [Unsloth Framework](https://github.com/unslothai/unsloth)
28
+ - **TRL-Optimized:** Integrated with Huggingface's TRL library for enhanced performance.
29
+
30
+ ## **Overview**
31
+ Sphinx is a state-of-the-art Long Chain-of-Thought (CoT) reasoning model designed to address complex, multi-step reasoning tasks with precision and clarity. Built on the Qwen2.5 architecture, Sphinx excels in generating coherent, logical thought processes while maintaining high levels of interpretability and explainability.
32
+
33
+ > _"Decoding complexity into clarity."_
34
+
35
+ ### **Key Features**
36
+ - **Enhanced CoT Reasoning:** Fine-tuned for generating multi-step solutions with deep logical consistency.
37
+ - **Efficient Performance:** Powered by Unsloth, achieving 2x faster training without compromising accuracy.
38
+ - **4-bit Quantization:** Optimized for resource-constrained environments while maintaining robust performance.
39
+ - **Multi-Task Versatility:** Excels in diverse domains, including mathematical proofs, legal reasoning, and advanced scientific problem-solving.
40
+ - **TRL Integration:** Employs reinforcement learning to improve generation quality through continuous feedback loops.
41
+
42
+ ## **Model Details**
43
+ ### **Architecture**
44
+ - **Base Model:** Qwen2.5-14B
45
+ - **Parameters:** 14 billion
46
+ - **Quantization:** 4-bit precision using BitsAndBytes (bnb).
47
+ - **Token Window:** Supports long-form inputs with a context window of up to 16k tokens, ideal for extensive reasoning tasks.
48
+
49
+ ### **Training Details**
50
+ - **Frameworks:** Huggingface Transformers + TRL + Unsloth.
51
+ - **Data Sources:** Curated datasets emphasizing reasoning tasks, including academic, legal, and logical contexts.
52
+ - **Optimization:** LoRA for parameter-efficient fine-tuning; RLHF for enhanced response alignment.
53
+
54
+ ### **Capabilities**
55
+ 1. **Long-CoT Generation:** Capable of breaking down and solving complex, multi-layered problems.
56
+ 2. **Explainable AI (XAI):** Provides clear, step-by-step reasoning for outputs.
57
+ 3. **Customizability:** Easily adaptable to niche reasoning tasks via lightweight fine-tuning.
58
+
59
+ ## **Applications**
60
+ - **Academic Research:** Generating detailed, structured analyses for scientific problems.
61
+ - **Legal Assistance:** Drafting and explaining multi-step legal arguments.
62
+ - **STEM Education:** Guiding students through intricate mathematical and logical problems.
63
+ - **Cognitive AI Systems:** Seamless integration into systems requiring transparent decision-making.
64