Triangle104 commited on
Commit
4c89beb
1 Parent(s): eb118cb

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +159 -0
README.md ADDED
@@ -0,0 +1,159 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: other
5
+ library_name: transformers
6
+ tags:
7
+ - chat
8
+ - llama-cpp
9
+ - gguf-my-repo
10
+ license_name: mrl
11
+ pipeline_tag: text-generation
12
+ datasets:
13
+ - anthracite-org/c2_logs_32k_mistral-v3_v1.2_no_system
14
+ - anthracite-org/kalo-opus-instruct-22k-no-refusal-no-system
15
+ - anthracite-org/kalo-opus-instruct-3k-filtered-no-system
16
+ - anthracite-org/nopm_claude_writing_fixed
17
+ - anthracite-org/kalo_opus_misc_240827_no_system
18
+ - anthracite-org/kalo_misc_part2_no_system
19
+ base_model: anthracite-org/magnum-v4-22b
20
+ model-index:
21
+ - name: magnum-v4-22b
22
+ results:
23
+ - task:
24
+ type: text-generation
25
+ name: Text Generation
26
+ dataset:
27
+ name: IFEval (0-Shot)
28
+ type: HuggingFaceH4/ifeval
29
+ args:
30
+ num_few_shot: 0
31
+ metrics:
32
+ - type: inst_level_strict_acc and prompt_level_strict_acc
33
+ value: 56.29
34
+ name: strict accuracy
35
+ source:
36
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=anthracite-org/magnum-v4-22b
37
+ name: Open LLM Leaderboard
38
+ - task:
39
+ type: text-generation
40
+ name: Text Generation
41
+ dataset:
42
+ name: BBH (3-Shot)
43
+ type: BBH
44
+ args:
45
+ num_few_shot: 3
46
+ metrics:
47
+ - type: acc_norm
48
+ value: 35.55
49
+ name: normalized accuracy
50
+ source:
51
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=anthracite-org/magnum-v4-22b
52
+ name: Open LLM Leaderboard
53
+ - task:
54
+ type: text-generation
55
+ name: Text Generation
56
+ dataset:
57
+ name: MATH Lvl 5 (4-Shot)
58
+ type: hendrycks/competition_math
59
+ args:
60
+ num_few_shot: 4
61
+ metrics:
62
+ - type: exact_match
63
+ value: 17.6
64
+ name: exact match
65
+ source:
66
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=anthracite-org/magnum-v4-22b
67
+ name: Open LLM Leaderboard
68
+ - task:
69
+ type: text-generation
70
+ name: Text Generation
71
+ dataset:
72
+ name: GPQA (0-shot)
73
+ type: Idavidrein/gpqa
74
+ args:
75
+ num_few_shot: 0
76
+ metrics:
77
+ - type: acc_norm
78
+ value: 10.4
79
+ name: acc_norm
80
+ source:
81
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=anthracite-org/magnum-v4-22b
82
+ name: Open LLM Leaderboard
83
+ - task:
84
+ type: text-generation
85
+ name: Text Generation
86
+ dataset:
87
+ name: MuSR (0-shot)
88
+ type: TAUR-Lab/MuSR
89
+ args:
90
+ num_few_shot: 0
91
+ metrics:
92
+ - type: acc_norm
93
+ value: 13.43
94
+ name: acc_norm
95
+ source:
96
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=anthracite-org/magnum-v4-22b
97
+ name: Open LLM Leaderboard
98
+ - task:
99
+ type: text-generation
100
+ name: Text Generation
101
+ dataset:
102
+ name: MMLU-PRO (5-shot)
103
+ type: TIGER-Lab/MMLU-Pro
104
+ config: main
105
+ split: test
106
+ args:
107
+ num_few_shot: 5
108
+ metrics:
109
+ - type: acc
110
+ value: 31.44
111
+ name: accuracy
112
+ source:
113
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=anthracite-org/magnum-v4-22b
114
+ name: Open LLM Leaderboard
115
+ ---
116
+
117
+ # Triangle104/magnum-v4-22b-Q4_K_M-GGUF
118
+ This model was converted to GGUF format from [`anthracite-org/magnum-v4-22b`](https://huggingface.co/anthracite-org/magnum-v4-22b) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
119
+ Refer to the [original model card](https://huggingface.co/anthracite-org/magnum-v4-22b) for more details on the model.
120
+
121
+ ## Use with llama.cpp
122
+ Install llama.cpp through brew (works on Mac and Linux)
123
+
124
+ ```bash
125
+ brew install llama.cpp
126
+
127
+ ```
128
+ Invoke the llama.cpp server or the CLI.
129
+
130
+ ### CLI:
131
+ ```bash
132
+ llama-cli --hf-repo Triangle104/magnum-v4-22b-Q4_K_M-GGUF --hf-file magnum-v4-22b-q4_k_m.gguf -p "The meaning to life and the universe is"
133
+ ```
134
+
135
+ ### Server:
136
+ ```bash
137
+ llama-server --hf-repo Triangle104/magnum-v4-22b-Q4_K_M-GGUF --hf-file magnum-v4-22b-q4_k_m.gguf -c 2048
138
+ ```
139
+
140
+ Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
141
+
142
+ Step 1: Clone llama.cpp from GitHub.
143
+ ```
144
+ git clone https://github.com/ggerganov/llama.cpp
145
+ ```
146
+
147
+ Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
148
+ ```
149
+ cd llama.cpp && LLAMA_CURL=1 make
150
+ ```
151
+
152
+ Step 3: Run inference through the main binary.
153
+ ```
154
+ ./llama-cli --hf-repo Triangle104/magnum-v4-22b-Q4_K_M-GGUF --hf-file magnum-v4-22b-q4_k_m.gguf -p "The meaning to life and the universe is"
155
+ ```
156
+ or
157
+ ```
158
+ ./llama-server --hf-repo Triangle104/magnum-v4-22b-Q4_K_M-GGUF --hf-file magnum-v4-22b-q4_k_m.gguf -c 2048
159
+ ```