zhiminy commited on
Commit
ab10f2f
·
1 Parent(s): 0caf7c8

add mistral models

Browse files
Files changed (2) hide show
  1. app.py +1 -1
  2. context_window.json +3 -0
app.py CHANGED
@@ -519,7 +519,7 @@ with gr.Blocks() as app:
519
  # ⚔️ Software Engineering (SE) Arena: Explore and Test the Best SE Chatbots with Long-Context Interactions
520
 
521
  ## 📜How It Works
522
- - **Blind Comparison**: Submit a SE-related query to two anonymous chatbots randomly selected from up to {len(available_models)} top models, including OpenAI-o3, Gemini-2.0, Claude-3.5, Deepseek-r1, Llama-3.3, Qwen-2.5, and others.
523
  - **Interactive Voting**: Engage in multi-turn dialogues with both chatbots and compare their responses. You can continue the conversation until you confidently choose the better model.
524
  - **Fair Play Rules**: Votes are counted only if chatbot identities remain anonymous. Revealing a chatbot's identity disqualifies the session.
525
 
 
519
  # ⚔️ Software Engineering (SE) Arena: Explore and Test the Best SE Chatbots with Long-Context Interactions
520
 
521
  ## 📜How It Works
522
+ - **Blind Comparison**: Submit a SE-related query to two anonymous chatbots randomly selected from up to {len(available_models)} top models, including OpenAI-o3, Gemini-2.0, Claude-3.5, Deepseek-r1, Mistral-large, Llama-3.3, Qwen-2.5, and others.
523
  - **Interactive Voting**: Engage in multi-turn dialogues with both chatbots and compare their responses. You can continue the conversation until you confidently choose the better model.
524
  - **Fair Play Rules**: Votes are counted only if chatbot identities remain anonymous. Revealing a chatbot's identity disqualifies the session.
525
 
context_window.json CHANGED
@@ -16,6 +16,9 @@
16
  "llama-3.1-405b": 128000,
17
  "llama-3.1-70b": 128000,
18
  "llama-3.3-70b": 128000,
 
 
 
19
  "o1": 128000,
20
  "o1-mini": 128000,
21
  "o3-mini": 200000,
 
16
  "llama-3.1-405b": 128000,
17
  "llama-3.1-70b": 128000,
18
  "llama-3.3-70b": 128000,
19
+ "mistral-embed": 8000,
20
+ "mistral-large-latest": 131000,
21
+ "mistral-small-latest": 32000,
22
  "o1": 128000,
23
  "o1-mini": 128000,
24
  "o3-mini": 200000,