SeanLee97 michaelfeil commited on
Commit
2c6b2a9
1 Parent(s): 584fb28

Add infinity as example deployment (#22)

Browse files

- Add infinity as example deployment (b44b442663045dcafdbdc54389417cd5ba6ffe2d)


Co-authored-by: Michael <michaelfeil@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +8 -0
README.md CHANGED
@@ -2701,6 +2701,14 @@ for dv in doc_vecs:
2701
  ```
2702
 
2703
 
 
 
 
 
 
 
 
 
2704
 
2705
  # Citation
2706
 
 
2701
  ```
2702
 
2703
 
2704
+ ## 3. Infinity
2705
+
2706
+ [Infinity](https://github.com/michaelfeil/infinity) is a MIT licensed server for OpenAI-compatible deployment.
2707
+ ```
2708
+ docker run --gpus all -v $PWD/data:/app/.cache -p "7997":"7997" \
2709
+ michaelf34/infinity:0.0.68 \
2710
+ v2 --model-id WhereIsAI/UAE-Large-V1 --revision "369c368f70f16a613f19f5598d4f12d9f44235d4" --dtype float16 --batch-size 32 --device cuda --engine torch --port 7997
2711
+ ```
2712
 
2713
  # Citation
2714