empower-dev
/

llama3-empower-functions-small

Text Generation

function-calling

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

liuylhf commited on May 17, 2024

Commit

38b30aa

·

verified ·

1 Parent(s): 78d3071

Update README.md

Files changed (1) hide show

README.md +47 -3

README.md CHANGED Viewed

@@ -1,3 +1,47 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+tags:
+- function
+- function-calling
+- tool-using
+---
+## Empower Functions Model
+[https://github.com/empower-ai/empower-functions](https://github.com/empower-ai/empower-functions)
+Empower Functions is a family of LLMs(large language models) that offer GPT-4 level capabilities for real-world "tool using" use cases, with full compatibility support to be served as a drop-in replacement.
+## Table of Contents
+## Key Features
+* Automatic tool using, able to decide when to use tools and when to converse, optimized for long conversations
+* Parallel call, supports calling one function multiple times, multiple functions, or a combination of both
+* Sequential calling, supports calling multiple functions sequentially to fulfill the user request
+* Streaming
+## Family of Models
+| Model                    | Specs                                                                                             | Links                  |                                     |
+| ------------------------ | ------------------------------------------------------------------------------------------------- | ---------------------- | ----------------------------------- |
+| empower-functions-small  | 8k context, based on [Llama3 8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B)               | model, GGUF, GGUF 4bit | most cost effective, local runnable |
+| empower-functions-medium | 32k context, based on [Mixtral 8x7B](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) | model                  | balance in accuracy and cost        |
+| empower-functions-large  | 65k context, based on [Mixtral 8x22B](https://huggingface.co/mistralai/Mixtral-8x22B-v0.1)        | model                  | best accuracy                       |
+### Hardware Requirement
+We have tested and the family of models in following setup:
+- empower-functions-small: fp16 on 1xA100 40G, GGUF and 4bit GGUF on Macbook M2 Pro with 32G RAM, in minimal the 4bit GGUF version requires 7.56G RAM.
+- empower-functions-medium: fp16 on 2xA100 80G
+- empower-functions-large: fp16 on 4xA100 80G
+## Usage
+There are three ways to use the empower-functions model. You can either directly [prompt the raw model](https://github.com/empower-ai/empower-functions?tab=readme-ov-file#prompt-raw-model), run it [locally](https://github.com/empower-ai/empower-functions?tab=readme-ov-file#running-locally) through llama-cpp-python, or using our [hosted API](https://github.com/empower-ai/empower-functions?tab=readme-ov-file#using-empower-api)
+## Demo App
+Check our healthcare appointment booking [demo](https://app.empower.dev/chat-demo)
+Want to customize the model? Please contact us at [founders@empower.dev](mailto:founders@empower.dev)