QuietImpostor
/

OpenELM-270M-Instruct-SonnOpus

Text Generation

Model card Files Files and versions Community

QuietImpostor commited on Jul 9

Commit

7dce2a2

•

1 Parent(s): 4dedaba

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -41,7 +41,7 @@ The model was fine-tuned on a synthetic dataset derived from GPT-4 (for user que
 ### Performance Metrics
 - **Training Loss:** Final loss of 1.3721 after 3 epochs
-- **Real-world Use** Seems to struggle with maintaining conversational context.
 ### Limitations and Current Shortcomings
 - The model's knowledge is limited to its training data and cut-off date.

 ### Performance Metrics
 - **Training Loss:** Final loss of 1.3721 after 3 epochs
+- **Real-world Use** Seems to struggle with maintaining conversational context on CUDA? CPU produces much more coherent results compared to CUDA.
 ### Limitations and Current Shortcomings
 - The model's knowledge is limited to its training data and cut-off date.