somosnlp
/

gemma-7b-it-legal-refugiados-es

transformers, pe

Generated from Trainer

Model card Files Files and versions Community

hacendado commited on Mar 29

Commit

8336ca5

•

1 Parent(s): b3ec055

update readme

Files changed (1) hide show

README.md +16 -1

README.md CHANGED Viewed

@@ -51,7 +51,22 @@ The model, while powerful, has limitations inherent to AI, including biases pres
 ### Training Data
-The dataset used was [instruct-legal-refugiados-es](https://huggingface.co/datasets/somosnlp/instruct-legal-refugiados-es) with [chatml gemma tokenizer](https://huggingface.co/philschmid/gemma-tokenizer-chatml)
 ### Training Procedure
 The training was done using RTX 4090 from Vast.ai with PeRF and Lora

 ### Training Data
+The dataset used was [instruct-legal-refugiados-es](https://huggingface.co/datasets/somosnlp/instruct-legal-refugiados-es)
+We wanted to make a conversation model so we investigated the base model prompt in order to make conversational base on [chatml format](https://github.com/MicrosoftDocs/azure-docs/blob/main/articles/ai-services/openai/includes/chat-markup-language.md#working-with-chat-markup-language-chatml)
+we identified the special tokens so the model could understand the different roles in the conversation
+Example
+```
+<bos><|im_start|>system
+You are Gemma.<|im_end|>
+<|im_start|>user
+Hello, how are you?<|im_end|>
+<|im_start|>assistant
+I'm doing great. How can I help you today?<|im_end|>\n<eos>
+```
+so we used [Phil Schmid's gemma chatml tokenizer](https://huggingface.co/philschmid/gemma-tokenizer-chatml) to adapt our dataset for training
 ### Training Procedure
 The training was done using RTX 4090 from Vast.ai with PeRF and Lora