Spaces:
Runtime error
Runtime error
add mistral models
Browse files- app.py +1 -1
- context_window.json +3 -0
app.py
CHANGED
@@ -519,7 +519,7 @@ with gr.Blocks() as app:
|
|
519 |
# ⚔️ Software Engineering (SE) Arena: Explore and Test the Best SE Chatbots with Long-Context Interactions
|
520 |
|
521 |
## 📜How It Works
|
522 |
-
- **Blind Comparison**: Submit a SE-related query to two anonymous chatbots randomly selected from up to {len(available_models)} top models, including OpenAI-o3, Gemini-2.0, Claude-3.5, Deepseek-r1, Llama-3.3, Qwen-2.5, and others.
|
523 |
- **Interactive Voting**: Engage in multi-turn dialogues with both chatbots and compare their responses. You can continue the conversation until you confidently choose the better model.
|
524 |
- **Fair Play Rules**: Votes are counted only if chatbot identities remain anonymous. Revealing a chatbot's identity disqualifies the session.
|
525 |
|
|
|
519 |
# ⚔️ Software Engineering (SE) Arena: Explore and Test the Best SE Chatbots with Long-Context Interactions
|
520 |
|
521 |
## 📜How It Works
|
522 |
+
- **Blind Comparison**: Submit a SE-related query to two anonymous chatbots randomly selected from up to {len(available_models)} top models, including OpenAI-o3, Gemini-2.0, Claude-3.5, Deepseek-r1, Mistral-large, Llama-3.3, Qwen-2.5, and others.
|
523 |
- **Interactive Voting**: Engage in multi-turn dialogues with both chatbots and compare their responses. You can continue the conversation until you confidently choose the better model.
|
524 |
- **Fair Play Rules**: Votes are counted only if chatbot identities remain anonymous. Revealing a chatbot's identity disqualifies the session.
|
525 |
|
context_window.json
CHANGED
@@ -16,6 +16,9 @@
|
|
16 |
"llama-3.1-405b": 128000,
|
17 |
"llama-3.1-70b": 128000,
|
18 |
"llama-3.3-70b": 128000,
|
|
|
|
|
|
|
19 |
"o1": 128000,
|
20 |
"o1-mini": 128000,
|
21 |
"o3-mini": 200000,
|
|
|
16 |
"llama-3.1-405b": 128000,
|
17 |
"llama-3.1-70b": 128000,
|
18 |
"llama-3.3-70b": 128000,
|
19 |
+
"mistral-embed": 8000,
|
20 |
+
"mistral-large-latest": 131000,
|
21 |
+
"mistral-small-latest": 32000,
|
22 |
"o1": 128000,
|
23 |
"o1-mini": 128000,
|
24 |
"o3-mini": 200000,
|