minpeter
/

Llama-3.2-1B-chatml-tool-v2

Text Generation

text-generation-inference

Model card Files Files and versions Community

minpeter commited on Feb 9

Commit

cf0b1b2

·

verified ·

1 Parent(s): e51a39e

Create README.md

Files changed (1) hide show

README.md +23 -0

README.md ADDED Viewed

	@@ -0,0 +1,23 @@

+---
+license: llama3.2
+datasets:
+  - teknium/OpenHermes-2.5
+  - NousResearch/hermes-function-calling-v1
+base_model:
+  - minpeter/QLoRA-Llama-3.2-1B-chatml-tool-v2
+  - minpeter/Llama-3.2-1B-AlternateTokenizer-chatml
+language:
+  - en
+pipeline_tag: text-generation
+library_name: transformers
+tags:
+  - axolotl
+  - merge
+---
+The only difference from Llama-3.2-1B-chatml-tool-v1 is that it uses AlternateTokenizer, which does not define tool-related tokens (<tools>, <tool_call>, <tool_response>).
+In the case of the existing tool-AlternateTokenizer, the <tool_call> tag was not properly generated before the function call, but in v2, it was observed that it performed well when trained with the general AlternateTokenizer.
+need to check whether this phenomenon is repeated in larger models (3B, 8B).