Daemontatox commited on
Commit
c03534a
·
verified ·
1 Parent(s): d41acf4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -107
README.md CHANGED
@@ -1,111 +1,6 @@
1
  ---
2
  base_model: Daemontatox/RA_Reasoner
3
- tags:
4
- - text-generation-inference
5
- - transformers
6
- - unsloth
7
- - llama
8
- - trl
9
- license: apache-2.0
10
- language:
11
- - en
12
- pipeline_tag: text-generation
13
- library_name: transformers
14
- model-index:
15
- - name: RA_Reasoner2.0
16
- results:
17
- - task:
18
- type: text-generation
19
- name: Text Generation
20
- dataset:
21
- name: IFEval (0-Shot)
22
- type: HuggingFaceH4/ifeval
23
- args:
24
- num_few_shot: 0
25
- metrics:
26
- - type: inst_level_strict_acc and prompt_level_strict_acc
27
- value: 55.92
28
- name: strict accuracy
29
- source:
30
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Daemontatox/RA_Reasoner
31
- name: Open LLM Leaderboard
32
- - task:
33
- type: text-generation
34
- name: Text Generation
35
- dataset:
36
- name: BBH (3-Shot)
37
- type: BBH
38
- args:
39
- num_few_shot: 3
40
- metrics:
41
- - type: acc_norm
42
- value: 43.07
43
- name: normalized accuracy
44
- source:
45
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Daemontatox/RA_Reasoner
46
- name: Open LLM Leaderboard
47
- - task:
48
- type: text-generation
49
- name: Text Generation
50
- dataset:
51
- name: MATH Lvl 5 (4-Shot)
52
- type: hendrycks/competition_math
53
- args:
54
- num_few_shot: 4
55
- metrics:
56
- - type: exact_match
57
- value: 20.09
58
- name: exact match
59
- source:
60
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Daemontatox/RA_Reasoner
61
- name: Open LLM Leaderboard
62
- - task:
63
- type: text-generation
64
- name: Text Generation
65
- dataset:
66
- name: GPQA (0-shot)
67
- type: Idavidrein/gpqa
68
- args:
69
- num_few_shot: 0
70
- metrics:
71
- - type: acc_norm
72
- value: 10.85
73
- name: acc_norm
74
- source:
75
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Daemontatox/RA_Reasoner
76
- name: Open LLM Leaderboard
77
- - task:
78
- type: text-generation
79
- name: Text Generation
80
- dataset:
81
- name: MuSR (0-shot)
82
- type: TAUR-Lab/MuSR
83
- args:
84
- num_few_shot: 0
85
- metrics:
86
- - type: acc_norm
87
- value: 7.51
88
- name: acc_norm
89
- source:
90
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Daemontatox/RA_Reasoner
91
- name: Open LLM Leaderboard
92
- - task:
93
- type: text-generation
94
- name: Text Generation
95
- dataset:
96
- name: MMLU-PRO (5-shot)
97
- type: TIGER-Lab/MMLU-Pro
98
- config: main
99
- split: test
100
- args:
101
- num_few_shot: 5
102
- metrics:
103
- - type: acc
104
- value: 36.67
105
- name: accuracy
106
- source:
107
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Daemontatox/RA_Reasoner
108
- name: Open LLM Leaderboard
109
  ---
110
  ![RA_REASONER](./image.webp)
111
  # Uploaded Model
@@ -131,4 +26,3 @@ This model is intended for research and development purposes related to text gen
131
  **Focus on Reasoning:** The fine-tuning has been geared towards enhancing the model's ability to tackle reasoning challenges and logic-based tasks.
132
 
133
 
134
- ---
 
1
  ---
2
  base_model: Daemontatox/RA_Reasoner
3
+
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
  ---
5
  ![RA_REASONER](./image.webp)
6
  # Uploaded Model
 
26
  **Focus on Reasoning:** The fine-tuning has been geared towards enhancing the model's ability to tackle reasoning challenges and logic-based tasks.
27
 
28