llmware
/

bling-phi-3-gguf

Model card Files Files and versions Community

doberst commited on May 2, 2024

Commit

7fc7eae

·

verified ·

1 Parent(s): 94aa11a

Update README.md

Files changed (1) hide show

README.md +5 -3

README.md CHANGED Viewed

@@ -23,7 +23,10 @@ Evaluated against the benchmark test:   [RAG-Instruct-Benchmark-Tester](https://
 --Summarization Quality (1-5):  4 (Above Average)
 --Hallucinations:  No hallucinations observed in test runs.
-For test run results (and good indicator of target use cases), please see the files ("core_rag_test" and "answer_sheet" in this repo).
 ### Model Description
@@ -79,8 +82,7 @@ Load in your favorite GGUF inference engine, or try with llmware as follows:
     # to load the model and make a basic inference
     model = ModelCatalog().load_model("llmware/bling-phi-3-gguf", temperature=0.0, sample=False)
-    response = model.function_call(text_sample)
 Details on the prompt wrapper and other configurations are on the config.json file in the files repository.

 --Summarization Quality (1-5):  4 (Above Average)
 --Hallucinations:  No hallucinations observed in test runs.
+For test run results (and good indicator of target use cases), please see the files ("core_rag_test" and "answer_sheet" in this repo).
+Note: compare results with [bling-phi-2](https://www.huggingface.co/llmware/bling-phi-2-v0), and [dragon-mistral-7b](https://www.huggingface.com/llmware/dragon-mistral-7b-v0).
 ### Model Description
     # to load the model and make a basic inference
     model = ModelCatalog().load_model("llmware/bling-phi-3-gguf", temperature=0.0, sample=False)
+    response = model.inference(query, add_context=text_sample)
 Details on the prompt wrapper and other configurations are on the config.json file in the files repository.