adamo1139
/

BasicEconomics-SpicyBoros-2.2-7B-QLORA-v0.1

Model card Files Files and versions Community

adamo1139 commited on Sep 17, 2023

Commit

3d43edc

•

1 Parent(s): 44c1835

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -14,4 +14,10 @@ The model can be used to ask questions about basic economic concepts, responses
 Prompt format:
 Reader: {prompt}
-'\nThomas:\n' {response}

 Prompt format:
 Reader: {prompt}
+'\nThomas:\n' {response}
+I was training on the sequence length of 1024, but I conversed with the model up to 4000 tokens and it was still coherent and in character.
+Even though the training date I used is only single turn, model has no issue with multi-turn conversations. Much of that is thanks to the fine-tuning done earlier by amazing Jon Durbin.
+Known issues:
+- tokenization didn't happen as I expected, so you can see a lot of /n, \' and ' characters in places where you shouldn't really see them. For example, most responses, if using the right prompt format, will have character ' at the end of response