manu commited on
Commit
b075b60
β€’
1 Parent(s): 40f8c3b

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +14 -8
app.py CHANGED
@@ -172,18 +172,24 @@ with gr.Blocks(analytics_enabled=False, css=custom_css) as demo:
172
  """
173
  ## Demo platform for πŸ₯ CroissantLLMChat
174
 
175
- The model is of small size (1.3B), about 130 times smaller than GPT3.
176
- As such, it's generalist Chat version logically exhibits reduced understanding, reasoning and knowledge capacities.
177
- For industrial uses, we recommend finetuning the model, but trained this Chat version to allow for experimenting and to showcase the capabilities for it's size.
178
 
179
- ## Usage recommendations
 
180
  We recommend testing the chat model for open-ended writing tasks, tips, translations, etc...
181
  We find direct instructions to work best, and performance to drop after the first round of interactions.
182
- We limit the length of the conversation so clear the Chat between tests !
 
 
183
 
184
- ## Errors
185
- The demo is linked to an endpoint that auto-shutdowns after 15mn. If error message appears, wait about 5 minutes and test again once the server is back up !
186
- The model can hallucinate and generate incorrect or even toxic content.
 
 
 
 
 
 
187
  """
188
  )
189
 
 
172
  """
173
  ## Demo platform for πŸ₯ CroissantLLMChat
174
 
 
 
 
175
 
176
+ ### Usage recommendations
177
+
178
  We recommend testing the chat model for open-ended writing tasks, tips, translations, etc...
179
  We find direct instructions to work best, and performance to drop after the first round of interactions.
180
+ We limit the length of each message to 256 tokens by default (can be changed in the settings below), and of the entire conversation so clear the Chat between tests !
181
+
182
+ ### Errors
183
 
184
+ The model is very small in size (1.3B), about 130 times smaller than GPT3. As such, it's generalist Chat version logically exhibits reduced understanding, reasoning and knowledge capacities, and may still exhibit undesired behavior such as hallucinations, or toxicity (rarely)...
185
+ For industrial applications, we recommend finetuning the model, but trained this Chat version to allow for experimenting and to showcase the capabilities for it's size.
186
+
187
+ ### More info
188
+ πŸ—žοΈ The blogpost: https://huggingface.co/blog/manu/croissant-llm-blog
189
+ πŸ“– The 45 page report with lots of gems: https://arxiv.org/abs/2402.00786
190
+ πŸ€– Models, Data, Demo: https://huggingface.co/croissantllm
191
+ ###
192
+
193
  """
194
  )
195