NousResearch
/

Hermes-2-Pro-Mistral-7B

Text Generation

function calling

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

teknium commited on Mar 13

Commit

0a29d8b

•

1 Parent(s): 174f590

Update README.md

Files changed (1) hide show

README.md +20 -0

README.md CHANGED Viewed

@@ -235,6 +235,26 @@ Average: 41.65
 |             |       |mc2   |0.5911|±  |0.0158|
 ```
 # Inference Code
 Here is example code using HuggingFace Transformers to inference the model (note: in 4bit, it will require around 5GB of VRAM)

 |             |       |mc2   |0.5911|±  |0.0158|
 ```
+# Function Calling Evaluations
+We worked with Fireworks.AI on evaluations by starting off with their Function Calling eval dataset, fixing some unsolveable ones, and generating a second eval dataset for JSON mode.
+## Function Calling Accuracy: 91%
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/XF3Zii4-QhE2yjWwHr_v4.png)
+## JSON Mode Accuracy: 84%
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/8H2iyjh5wyP2FtLq2LCed.png)
+Run the evaluator yourself using @interstellarninja's codebase here:
+https://github.com/interstellarninja/function-calling-eval
+You can find the evaluation datasets here:
+https://huggingface.co/datasets/NousResearch/func-calling-eval
+https://huggingface.co/datasets/NousResearch/json-mode-eval
 # Inference Code
 Here is example code using HuggingFace Transformers to inference the model (note: in 4bit, it will require around 5GB of VRAM)